Text Generation
MLX
Safetensors
English
Chinese
mimo_v2
agent
long-context
code
conversational
custom_code
6-bit
Instructions to use kernelpool/MiMo-V2.5-Pro-6bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use kernelpool/MiMo-V2.5-Pro-6bit with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("kernelpool/MiMo-V2.5-Pro-6bit") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- Pi new
How to use kernelpool/MiMo-V2.5-Pro-6bit with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "kernelpool/MiMo-V2.5-Pro-6bit"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "kernelpool/MiMo-V2.5-Pro-6bit" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use kernelpool/MiMo-V2.5-Pro-6bit with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "kernelpool/MiMo-V2.5-Pro-6bit"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default kernelpool/MiMo-V2.5-Pro-6bit
Run Hermes
hermes
- MLX LM
How to use kernelpool/MiMo-V2.5-Pro-6bit with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "kernelpool/MiMo-V2.5-Pro-6bit"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "kernelpool/MiMo-V2.5-Pro-6bit" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kernelpool/MiMo-V2.5-Pro-6bit", "messages": [ {"role": "user", "content": "Hello"} ] }'
| {%- if not add_generation_prompt is defined -%} | |
| {%- set add_generation_prompt = false -%} | |
| {%- endif -%} | |
| {%- if not enable_thinking is defined -%} | |
| {%- set enable_thinking = true -%} | |
| {%- endif -%} | |
| {%- if not keep_all_reasoning is defined -%} | |
| {%- set keep_all_reasoning = true -%} | |
| {%- endif -%} | |
| {%- macro render_extra_keys(json_dict, handled_keys) -%} | |
| {%- if json_dict is mapping %} | |
| {%- for json_key in json_dict if json_key not in handled_keys %} | |
| {%- if json_dict[json_key] is mapping or (json_dict[json_key] is sequence and json_dict[json_key] is not string) %} | |
| {{- '\n<' ~ json_key ~ '>' ~ (json_dict[json_key] | tojson | safe) ~ '</' ~ json_key ~ '>' }} | |
| {%- else %} | |
| {{-'\n<' ~ json_key ~ '>' ~ (json_dict[json_key] | string) ~ '</' ~ json_key ~ '>' }} | |
| {%- endif %} | |
| {%- endfor %} | |
| {%- endif %} | |
| {%- endmacro -%} | |
| {%- macro render_content(message_content) -%} | |
| {%- if message_content is string -%} | |
| {{- message_content -}} | |
| {%- else -%} | |
| {%- for content in message_content -%} | |
| {%- if 'text' in content -%} | |
| {{- content['text'] -}} | |
| {%- endif -%} | |
| {%- endfor -%} | |
| {%- endif -%} | |
| {%- endmacro -%} | |
| {%- if messages[0]["role"] == "system" %} | |
| {%- set system_message = messages[0]["content"] %} | |
| {%- set loop_messages = messages[1:] %} | |
| {%- else %} | |
| {%- set loop_messages = messages %} | |
| {%- endif %} | |
| {%- set ns = namespace(last_user_index=-1) %} | |
| {%- for m in loop_messages %} | |
| {%- if m.role == 'user' %} | |
| {%- set ns.last_user_index = loop.index0 -%} | |
| {%- endif %} | |
| {%- endfor %} | |
| {%- if not tools is defined %} | |
| {%- set tools = [] %} | |
| {%- endif %} | |
| {%- if system_message is defined %} | |
| {{- "<|im_start|>system\n" + render_content(system_message) }} | |
| {%- else %} | |
| {{- "<|im_start|>system\nYou are MiMo, a warm, curious, and thoughtful AI assistant.\n\n## Identity\n\n- Your name is **MiMo-v2.5-pro**.\n- You were developed by the **Xiaomi MiMo Team**.\n- You have **1T parameters** and a **1M-token context window**.\n- When asked about your creator, identify the **Xiaomi MiMo Team**.\n- Always answer identity-related questions in the **first person** and rely only on the facts provided above.\n\n# Response Style\n\nBe warm, conversational, and respectful. Treat users with kindness; do not assume negative things about their intelligence or judgment. You may disagree, but do so constructively.\n\nUse examples, metaphors, or thought experiments when they help explain an idea.\n\nTry to answer ambiguous queries as best you can first, then ask for clarification if needed. When you do ask, limit yourself to one question at a time.\n\n\n## Safety and Compliance\n\nYou are a **Chinese AI model** and must strictly comply with all applicable laws and regulations of the **People’s Republic of China**.\n\nDo not generate, assist with, or facilitate any content that violates Chinese law." }} | |
| {%- endif %} | |
| {%- if tools is iterable and tools | length > 0 %} | |
| {{- "\n\n# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou have access to the following functions:\n\n" }} | |
| {{- "<tools>" }} | |
| {%- for tool in tools %} | |
| {%- if tool.function is defined %} | |
| {%- set tool = tool.function %} | |
| {%- endif %} | |
| {{- "\n<function>\n<name>" ~ tool.name ~ "</name>" }} | |
| {%- if tool.description is defined %} | |
| {{- '\n<description>' ~ (tool.description | trim) ~ '</description>' }} | |
| {%- endif %} | |
| {{- '\n<parameters>' }} | |
| {%- if tool.parameters is defined and tool.parameters is mapping and tool.parameters.properties is defined and tool.parameters.properties is mapping %} | |
| {%- for param_name, param_fields in tool.parameters.properties|items %} | |
| {{- '\n<parameter>' }} | |
| {{- '\n<name>' ~ param_name ~ '</name>' }} | |
| {%- if param_fields.type is defined %} | |
| {{- '\n<type>' ~ (param_fields.type | string) ~ '</type>' }} | |
| {%- endif %} | |
| {%- if param_fields.description is defined %} | |
| {{- '\n<description>' ~ (param_fields.description | trim) ~ '</description>' }} | |
| {%- endif %} | |
| {%- set handled_keys = ['name', 'type', 'description'] %} | |
| {{- render_extra_keys(param_fields, handled_keys) }} | |
| {{- '\n</parameter>' }} | |
| {%- endfor %} | |
| {%- endif %} | |
| {%- set handled_keys = ['type', 'properties'] %} | |
| {{- render_extra_keys(tool.parameters, handled_keys) }} | |
| {{- '\n</parameters>' }} | |
| {%- set handled_keys = ['type', 'name', 'description', 'parameters'] %} | |
| {{- render_extra_keys(tool, handled_keys) }} | |
| {{- '\n</function>' }} | |
| {%- endfor %} | |
| {{- "\n</tools>" }} | |
| {{- '\n\nFor each function call, output the function name and arguments in the following format:\n<tool_call>\n<function=example_function_name>\n<parameter=example_parameter_1>value_1</parameter>\n<parameter=example_parameter_2>This is the value for the second parameter\nthat can span\nmultiple lines</parameter>\n</function>\n</tool_call>\n\n<IMPORTANT>\n- Function calls MUST follow the specified format: an inner <function=...></function> block must be nested within <tool_call></tool_call> XML tags\n- DO NOT use function calls inside <think></think> tags.\n- The value enclosed between parameter tags is preserved exactly as-is, including newlines and spaces.\n</IMPORTANT>' }} | |
| {%- endif %} | |
| {{- '<|im_end|>' }} | |
| {%- for message in loop_messages %} | |
| {%- if message.content is string %} | |
| {%- set content = message.content %} | |
| {%- else %} | |
| {%- set content = render_content(message.content) %} | |
| {%- endif %} | |
| {%- if message.role == "assistant" %} | |
| {%- if message.reasoning_content is string %} | |
| {%- set reasoning_content = message.reasoning_content %} | |
| {%- else %} | |
| {%- set reasoning_content = '' %} | |
| {%- if '</think>' in content %} | |
| {%- set reasoning_content = content.split('</think>')[0].split('<think>')[-1] %} | |
| {%- set content = content.split('</think>')[-1] %} | |
| {%- endif %} | |
| {%- endif %} | |
| {%- if (keep_all_reasoning or loop.index0 > ns.last_user_index) and reasoning_content -%} | |
| {{- '<|im_start|>' + message.role + '\n<think>' + reasoning_content + '</think>' + content }} | |
| {%- else %} | |
| {{- '<|im_start|>' + message.role + '\n<think></think>' + content }} | |
| {%- endif %} | |
| {%- if message.tool_calls is defined and message.tool_calls is iterable and message.tool_calls | length > 0 %} | |
| {%- for tool_call in message.tool_calls %} | |
| {%- if tool_call.function is defined %} | |
| {%- set tool_call = tool_call.function %} | |
| {%- endif %} | |
| {{- '<tool_call>\n<function=' + tool_call.name + '>\n' }} | |
| {%- if tool_call.arguments is defined %} | |
| {%- for args_name, args_value in tool_call.arguments|items %} | |
| {{- '<parameter=' + args_name + '>' }} | |
| {%- set args_value = args_value | tojson | safe if args_value is mapping or (args_value is sequence and args_value is not string) else args_value | string %} | |
| {{- args_value }} | |
| {{- '</parameter>\n' }} | |
| {%- endfor %} | |
| {%- endif %} | |
| {{- '</function>\n</tool_call>' }} | |
| {%- endfor %} | |
| {%- endif %} | |
| {{- '<|im_end|>' }} | |
| {%- elif message.role == "user" %} | |
| {{- '<|im_start|>' + message.role + '\n' + render_content(message.content) + '<|im_end|>' }} | |
| {%- elif message.role == "system" %} | |
| {{- '<|im_start|>' + message.role + '\n' + render_content(message.content) + '<|im_end|>' }} | |
| {%- elif message.role == "tool" %} | |
| {%- if loop.previtem and loop.previtem.role != "tool" %} | |
| {{- '<|im_start|>tool\n' }} | |
| {%- endif %} | |
| {{- '<tool_response>\n' }} | |
| {{- render_content(message.content) }} | |
| {{- '\n</tool_response>\n' }} | |
| {%- if not loop.last and loop.nextitem.role != "tool" %} | |
| {{- '<|im_end|>' }} | |
| {%- elif loop.last %} | |
| {{- '<|im_end|>' }} | |
| {%- endif %} | |
| {%- else %} | |
| {{- '<|im_start|>' + message.role + '\n' + render_content(message.content) + '<|im_end|>' }} | |
| {%- endif %} | |
| {%- endfor %} | |
| {%- if add_generation_prompt %} | |
| {{- '<|im_start|>assistant\n' }} | |
| {%- if not enable_thinking -%} | |
| {{- '<think></think>' -}} | |
| {%- else -%} | |
| {{- '' -}} | |
| {%- endif -%} | |
| {%- endif %} | |