AI Engineering World Fair
What's new?
Last year, we saw a lot of interest in the use of LLMs for new use cases. This year, with more funding and interest in the space, we've finally started thinking about productionizing these models at scale and making sure that they're reliable, consistent and secure.
Let's start with a few definitions
-
Agent : This is a LLM which is provided with a few tools it can call. The agentic part of this system comes from the ability to make decisions based on some input. This is similar to Harrison Chase's article here
-
Evaluations : A set of metrics that we can look at to understand where our current system falls short. An example could be measuring precision and recall.
-
Synthethic Data Generation: Data generated by a LLM which is meant to mimic real data