* feat: add Cerebras as an LLM provider
Adds Cerebras (https://www.cerebras.ai) as a first-class LLM provider for
chat and agents. Cerebras exposes an OpenAI-compatible inference API, so the
integration mirrors the existing fast-inference providers (Groq/SambaNova):
- New `CerebrasLLM` provider with streaming, prompt-window limits sourced from
the shared model map, and dynamic model discovery via the customModels
helper (`GET /v1/models`).
- New `CerebrasProvider` agent provider with the standard UnTooled fallback and
optional native OpenAI-compatible tool calling, gated behind the existing
`PROVIDER_SUPPORTS_NATIVE_TOOL_CALLING` ENV (since tool-calling support
varies per Cerebras model).
- Wires the provider through getLLMProvider/getLLMProviderClass, agent
selection, ENV validation/persistence, system settings, and the agent
sub-task client builder so agent helpers (e.g. web-scraping summarization)
work for Cerebras.
- Registers `cerebras` in the context-window model map (tracked LiteLLM
provider + legacy fallback values) for accurate context limits.
- Frontend: API key + model-selection options, onboarding/settings provider
entries, agent-provider enablement, privacy-policy entry, and provider logo.
- Documents the new `CEREBRAS_API_KEY` / `CEREBRAS_MODEL_PREF` env vars in the
example env files and adds Cerebras to the supported-provider lists.
resolves#5605
* use proper model fallback
use public endpoint for model fetching
* context window reporting + better chat repsonses + show thinking
* add Cerebrase agent tool calling, capability detection, context windows, and usage reporting
* add safety caching
* stub out images
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
* add MiniMax LLM provider support
* add MiniMax LLM provider to the docs
* fix: add trailing newlines for lint compliance
* add env vars to system settings | make max tokens configurable via ui | pass max tokens into minimax llm provider
* change fallback max tokens value to null to use provider default | pass max tokens into handleFucntionCallStream and chat
* add minimax to getModelTag switch
* pass provider into tooledStream and tooledComplete
* remove max tokens param
* update image
---------
Co-authored-by: angelplusultra <macfittondev@gmail.com>
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
* Improve DMR support
- Autodetect models installed
- Grab all models from hub.docker to show available
- UI to handle render,search, install, and management of models
- Support functionality for chat, stream, and agentic calls
* forgot files
* fix loader circle being too large
fix tooltip width command
adjust location of docker installer open for web platform
* adjust imports
* feat: Implement CometAPI integration for chat completions and model management
- Added CometApiLLM class for handling chat completions using CometAPI.
- Implemented model synchronization and caching mechanisms.
- Introduced streaming support for chat responses with timeout handling.
- Created CometApiProvider class for agent interactions with CometAPI.
- Enhanced error handling and logging throughout the integration.
- Established a structure for managing function calls and completions.
* linting
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
* PGVector support for vector db storage
* forgot files
* comments
* dev build
* Add ENV connection and table schema validations for vector table
add .reset call to drop embedding table when changing the AnythingLLM embedder
update instrutions
Add preCheck error reporting in UpdateENV
add timeout to pg connection
* update setup
* update README
* update doc
* feat: add new model provider PPIO
* fix: fix ppio model fetching
* fix: code lint
* reorder LLM
update interface for streaming and chats to use valid keys
linting
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
fix:
docs(zh-CN): fix text and format issues in Chinese documentation
- Remove redundant punctuation and repeated words
- Standardize quotation marks format and remove unnecessary quotes
- Add and complete TTS and STT related translations
* feat: add new model provider: Novita AI
* feat: finished novita AI
* fix: code lint
* remove unneeded logging
* add back log for novita stream not self closing
* Clarify ENV vars for LLM/embedder seperation for future
Patch ENV check for workspace/agent provider
---------
Co-authored-by: Jason <ggbbddjm@gmail.com>
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
* Issue #1943: Add support for LLM provider - Fireworks AI
* Update UI selection boxes
Update base AI keys for future embedder support if needed
Add agent capabilites for FireworksAI
* class only return
---------
Co-authored-by: Aaron Van Doren <vandoren96+1@gmail.com>