MCP Marketplace
BrowseHow It WorksFor CreatorsDocs
Sign inSign up
MCP Marketplace

The curated, security-first marketplace for AI tools.

Product

Browse ToolsSubmit a ToolDocumentationHow It WorksBlogFAQ

Legal

Terms of ServicePrivacy PolicyCommunity Guidelines

Connect

support@mcp-marketplace.ioTwitter / XDiscord

MCP Marketplace © 2026. All rights reserved.

Back to Browse

Quelllm MCP Server

by MGM FALCON
Developer ToolsLow Risk10.0MCP RegistryLocal
Free

Server data from the Official MCP Registry

MCP server for quelllm.fr: 190+ open-weights LLM catalog - list, compare, VRAM and cost estimates

About

MCP server for quelllm.fr: 190+ open-weights LLM catalog - list, compare, VRAM and cost estimates

Security Report

10.0
Low Risk10.0Low Risk

Valid MCP server (2 strong, 4 medium validity signals). No known CVEs in dependencies. Imported from the Official MCP Registry.

7 files analyzed · No issues found

Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.

Permissions Required

This plugin requests these system permissions. Most are normal for its category.

file_system

Check that this permission is expected for this type of plugin.

Documentation

View on GitHub

From the project's GitHub README.

quelllm-mcp

MCP server exposing the quelllm.fr catalog of 190+ open-weights LLMs via Model Context Protocol tools. Use it from Claude Code, Cursor, Continue, or any MCP-compatible client to query models, compare them, estimate VRAM, and compute API vs self-hosted cost.

Tools exposed

ToolDescription
list_models(filter_origin?, filter_family?, max_params_b?)List models with filters (origin code, family, max params in B)
get_model(model_id)Full record for one model (params, vram per quant, context window, family, tags, license, URLs)
compare(model_a_id, model_b_id)Side-by-side comparison with verdict
estimate_vram(model_id, quant)VRAM in GB at chosen quant + recommended GPU/Mac tiers
estimate_cost(input_tokens_per_month, output_tokens_per_month, ...)Cost in EUR — full table API providers vs self-hosted hardware OR a specific id
search_models(query, limit?)Fuzzy search by name, family, tag, author

Install

Install from source (not yet on PyPI) :

pip install git+https://github.com/MGM-FALCON/quelllm-mcp.git

Or run without installing, using uv :

uvx --from git+https://github.com/MGM-FALCON/quelllm-mcp.git quelllm-mcp

For local development :

git clone https://github.com/MGM-FALCON/quelllm-mcp.git
cd quelllm-mcp
pip install -e .

Use with Claude Code

Add to ~/.claude.json or a project's .mcp.json. If you installed with pip :

{
  "mcpServers": {
    "quelllm": {
      "command": "quelllm-mcp"
    }
  }
}

Or zero-install with uvx :

{
  "mcpServers": {
    "quelllm": {
      "command": "uvx",
      "args": ["--from", "git+https://github.com/MGM-FALCON/quelllm-mcp.git", "quelllm-mcp"]
    }
  }
}

Use with Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) :

{
  "mcpServers": {
    "quelllm": {
      "command": "quelllm-mcp"
    }
  }
}

Use with Cursor / Continue / Cline

Most MCP clients accept the same JSON config :

{
  "command": "quelllm-mcp"
}

Example queries (from your client)

> Quels LLM Mistral peuvent tourner sur RTX 5070 Ti 16GB ?
→ list_models(filter_family='Mistral', max_params_b=24)
→ estimate_vram('mistral-small-24b', 'q4')

> Compare Llama 3.3 70B vs Qwen 2.5 32B
→ compare('llama33-70b', 'qwen25-32b')

> J'utilise 10M tokens input + 2.5M output / mois. Combien je paye chez OpenAI vs DeepSeek ?
→ estimate_cost(10_000_000, 2_500_000)

Data source

All data pulled from quelllm.fr/api/ (CC BY 4.0, no key, CORS-enabled). Cached locally for 1h to avoid rate-limiting.

API pricing data (GPT-5, Claude Opus 4.7, Gemini 2.5, DeepSeek, Mistral) and hardware pricing (RTX 50-series, Mac M4) are hardcoded as of 2026-05 — verify semestrially.

License

MIT — see LICENSE.

Contributing

Source : https://github.com/MGM-FALCON/quelllm-mcp Issues + PRs welcome. Particularly :

  • API pricing updates (semestrial)
  • Hardware additions (new GPUs, Mac Mx series)
  • New tools (e.g. find_alternatives_to(model_id), recommend_gpu(budget_eur))

Tests

A pytest smoke suite lives under tests/. It covers all 6 tools and the v1.1.0 output invariants, never touches the network (local fixture + mocked httpx), and stubs the mcp SDK when it isn't importable — so it also runs on Python 3.9.

pip install -e ".[test]"
pytest

Author

Mohamed Meguedmi — LinkedIn · Hugging Face Founder of La Gazette IA and QuelLLM.fr.

Reviews

No reviews yet

Be the first to review this server!

0

installs

New

no ratings yet

Is this your server?

Claim ownership to manage your listing, respond to reviews, and track installs from your dashboard.

Claim with GitHub

Sign up with the GitHub account that owns this repo

Links

Source CodeDocumentation

Details

Published July 1, 2026
Version 1.1.0
0 installs
Local Plugin

More Developer Tools MCP Servers

Fetch

Free

by Modelcontextprotocol · Developer Tools

Web content fetching and conversion for efficient LLM usage

80.0K
Stars
5
Installs
5.3
Security
No ratings yet
Local

Git

Free

by Modelcontextprotocol · Developer Tools

Read, search, and manipulate Git repositories programmatically

80.0K
Stars
6
Installs
6.5
Security
No ratings yet
Local

Toleno

Free

by Toleno · Developer Tools

Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.

137
Stars
533
Installs
8.0
Security
4.8
Local

mcp-creator-python

Free

by mcp-marketplace · Developer Tools

Create, build, and publish Python MCP servers to PyPI — conversationally.

-
Stars
80
Installs
10.0
Security
4.6
Local

MarkItDown

Free

by Microsoft · Content & Media

Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption

156.1K
Stars
43
Installs
6.0
Security
5.0
Local

MCP Marketplace

Free

by mcp-marketplace · Developer Tools

Search and install MCP servers from inside your AI client.

-
Stars
26
Installs
10.0
Security
5.0
Remote