Chat Models | Notion

https://python.langchain.com/docs/concepts/chat_models/

Modern LLMs are typically accessed through a chat model interface that takes a list of messages as input and return a message as output.
Additional capabilities offer by newest generation chat models are:
1. Tool calling: Many popular chat models offer a native tool calling API. This API allows developers to build rich applications that enable LLMs to interact with external services, APIs, and databases. Tool calling can also be used to extract structured information from unstructured data and perform various other tasks.
2. Structured output: A technique to make a chat model respond in a structured format, such as JSON that matches a given schema.
3. Multimodality: The ability to work with data other than text; for example, images, audio, and video.
https://medium.com/@ihamzakhan89/langchain-showdown-llms-vs-chat-models-ebf044e6827d
Features:
- Integrations with many chat model providers (e.g., Anthropic, OpenAI, Ollama, Microsoft Azure, Google Vertex, Amazon Bedrock, Hugging Face, Cohere, Groq). Please see chat model integrations for an up-to-date list of supported models.
- Use either LangChain's messages format or OpenAI format.
- Standard tool calling API: standard interface for binding tools to models, accessing tool call requests made by models, and sending tool results back to the model.
- Standard API for structuring outputs via the with_structured_output method.
- Provides support for async programming, efficient batching, a rich streaming API.
- Integration with LangSmith for monitoring and debugging production-grade applications based on LLMs.
- Additional features like standardized token usage, rate limiting, caching and more.
1. These integrations are one of two types:
  1. Official models: These are models that are officially supported by LangChain and/or model provider. You can find these models in the langchain-<provider> packages.
  2. Community models: There are models that are mostly contributed and supported by the community. You can find these models in the langchain-community package.
2. LangChain chat models implement the BaseChatModel interface. Because BaseChatModel also implements the Runnable Interface, chat models support a standard streaming interface, async programming, optimized batching, and more.
3. Chat models offer a standard set of parameters that can be used to configure the model. These parameters are typically used to control the behavior of the model, such as the temperature of the output, the maximum number of tokens in the response, and the maximum time to wait for a response.

<aside> 📍

Note:

In documentation, we will often use the terms "LLM" and "Chat Model" interchangeably. This is because most modern LLMs are exposed to users via a chat model interface.

However, LangChain also has implementations of older LLMs that do not follow the chat model interface and instead use an interface that takes a string as input and returns a string as output. These models are typically named without the "Chat" prefix (e.g., Ollama, Anthropic, OpenAI, etc.). These models implement the BaseLLM interface and may be named with the "LLM" suffix (e.g., OllamaLLM, AnthropicLLM, OpenAILLM, etc.). Generally, users should not use these models.

</aside>

Key methods:

The key methods of a chat model are:

invoke: The primary method for interacting with a chat model. It takes a list of messages as input and returns a list of messages as output.
stream: A method that allows you to stream the output of a chat model as it is generated.
batch: A method that allows you to batch multiple requests to a chat model together for more efficient processing.
bind_tools: A method that allows you to bind a tool to a chat model for use in the model's execution context.
with_structured_output: A wrapper around the invoke method for models that natively support structured output.

Other important methods can be found in the BaseChatModel API Reference.

LangChain supports two message formats to interact with chat models:

LangChain Message Format: LangChain's own message format, which is used by default and is used internally by LangChain.
OpenAI's Message Format: OpenAI's message format.

Key methods:

Standard parameters