Cloud Models

Bring the model setup you already like into Snaply. The selected model powers Writing Assistant, meeting titles, AI meeting notes, and Meeting Notes Ask AI. Use the local model when you want the most private, offline-friendly default. Switch to a cloud model when you want a specific provider, a larger model, stronger reasoning, or the same model your team already uses elsewhere. This is the right page if you are looking for broader model support, OpenRouter, Claude/OpenAI/Gemini-style presets, Mistral, or a local OpenAI-compatible setup with Ollama, LM Studio, or LiteLLM.

Supported Providers

Snaply includes built-in provider setup for:

OpenAI
Anthropic
Google Gemini
OpenRouter
xAI
Groq
Mistral
OpenAI-compatible endpoint

The catalog contains hundreds of models. Snaply keeps the first choice simple by showing recommended models first, then gives you search when you want to pick a specific model.

Common Requested Setups

What you want	Where to start
OpenAI, Claude, Gemini, Mistral, Groq, xAI, or OpenRouter	Pick the provider in Advanced model setup
A local Ollama model	Use OpenAI-compatible endpoint with `http://localhost:11434/v1`
LM Studio’s local server	Use OpenAI-compatible endpoint with `http://localhost:1234/v1`
LiteLLM, a private proxy, or a company gateway	Use OpenAI-compatible endpoint with that server’s `/v1` base URL
One API key for many model families	Use OpenRouter

The Simple Setup

Open Settings -> Writing Assistant -> AI Models.
Go to Advanced model setup.
Pick the provider you want.
Choose a recommended model, or search the full catalog for a specific one.
Click Add API key and paste the provider key.
Select Use on the model you want.

That is it. From then on, Writing Assistant, meeting titles, AI notes, and Ask AI use that model. Your provider account still controls pricing, rate limits, billing, and data handling for that provider.

OpenAI-Compatible Endpoint

Choose OpenAI-compatible endpoint when you want Snaply to talk to something custom: Ollama, LM Studio, LiteLLM, a local model server, private proxy, company gateway, or hosted provider with an OpenAI-style API. Snaply asks for:

Base URL: the endpoint root, for example http://localhost:11434/v1
Model name: the exact model identifier your endpoint expects
API key: optional, only needed if the endpoint requires a bearer token

After you save it, the endpoint behaves like any other AI model in Snaply.

Ollama, LM Studio, And LiteLLM

This is the easiest way to bring local models you already run elsewhere into Snaply.

Tool	Base URL to try	Model name
Ollama	`http://localhost:11434/v1`	The model you pulled in Ollama, such as `llama3.2` or `llama3.1:8b`
LM Studio	`http://localhost:1234/v1`	The model identifier shown by LM Studio’s local server
LiteLLM or another OpenAI-compatible server	Your server’s `/v1` endpoint	The model name that server expects

For local tools like Ollama and LM Studio, you can usually leave API key empty unless you turned on authentication yourself. If you type http://localhost:11434 or http://localhost:1234, Snaply can suggest the matching /v1 URL.

API Key Storage

Cloud API keys are stored in Snaply’s encrypted app vault on your Mac. From the API key dialog, you can:

Add a key for each provider
See which providers have a key saved
Reveal a saved key when needed
Remove a key
Load previously saved keys from older Snaply builds

OpenAI-compatible endpoints can be used with or without a key, depending on how your endpoint is configured.

What Gets Sent To A Cloud Model

When a cloud model is selected, Snaply sends only the text needed for the request to that provider. That can include:

Selected text for Writing Assistant edits
The prompt and chat history for Writing Assistant chat
Meeting transcript content for generated titles and notes
Meeting transcript and notes context for Ask AI

Meeting recordings and transcripts are still stored locally by Snaply. The important distinction is generation: if the selected model is cloud, the text needed for that generation goes to the selected provider. For OpenAI-compatible endpoints, the text goes to the endpoint URL you configured. If that endpoint is Ollama or LM Studio running on localhost, the request stays on your machine. If it is LiteLLM, a hosted gateway, or a company proxy, the request goes there.

When To Use Each Option

Use case	Good choice
Private offline rewriting	Local Daily Assistant model
Strong general writing and reasoning	OpenAI, Anthropic, or Google Gemini
Access to many model families from one key	OpenRouter
Fast hosted inference	Groq
Mistral or Codestral workflows	Mistral
Grok models	xAI
Ollama, LM Studio, LiteLLM, local server, proxy, or custom gateway	OpenAI-compatible endpoint

Troubleshooting

If a cloud model cannot run, Snaply shows an error card with the next step. Most issues come down to one of these:

Add the missing API key
Choose another model from the same provider
Check quota, billing, or rate limits in the provider account
For OpenAI-compatible endpoints, confirm the base URL, model name, and bearer token

Overview

Getting Started

Voice Dictation

Writing Assistant

Meeting Notes

Supported Providers

Common Requested Setups

The Simple Setup

OpenAI-Compatible Endpoint

Ollama, LM Studio, And LiteLLM

API Key Storage

What Gets Sent To A Cloud Model

When To Use Each Option

Troubleshooting

Overview

Getting Started

Voice Dictation

Writing Assistant

Meeting Notes

Documentation Index

​Supported Providers

​Common Requested Setups

​The Simple Setup

​OpenAI-Compatible Endpoint

​Ollama, LM Studio, And LiteLLM

​API Key Storage

​What Gets Sent To A Cloud Model

​When To Use Each Option

​Troubleshooting

​Related Pages

Supported Providers

Common Requested Setups

The Simple Setup

OpenAI-Compatible Endpoint

Ollama, LM Studio, And LiteLLM

API Key Storage

What Gets Sent To A Cloud Model

When To Use Each Option

Troubleshooting

Related Pages