Perceptron: Perceptron Mk1

Perceptron perceptron-mk1
Model Information
Slug perceptron-mk1
LLM.txt View
Release Date May 12, 2026 New
Organization
Model Description
Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding responses, either structured or natural language. It excels at video understanding tasks like video QA, summarization, and event detection. On image inputs, it advances point-by-example grounding from multimodal prompts, OCR and document parsing on messy real-world inputs, open vocabulary object detection and counting, and hand pose estimation.

Reasoning can be enabled per request to trade latency for deeper analysis on harder tasks. Structured annotations are emitted inline with text only when explicitly requested via the `annotation_format` parameter (pass `"point"`, `"box"`, or `"polygon"` for spatial localization on images, or `"clip"` (start/end timestamps) for temporal segments in video). Without `annotation_format`, the model returns natural-language text only.
Available at 5 Providers
Provider Type Model Name Original Model Input ($/1M) Output ($/1M) Free Actions
Cline
Cline
Code
Perceptron: Perceptron Mk1
perceptron/perceptron-mk1 $0.15 $1.50
Kilo Code
Kilo Code
Code
Perceptron: Perceptron Mk1
perceptron/perceptron-mk1 $0.15 $1.50
Nous Research
Nous Research
Perceptron: Perceptron Mk1
perceptron/perceptron-mk1 $0.15 $1.50
OpenRouter
OpenRouter
Chat Code
Perceptron Mk1
perceptron/perceptron-mk1 $0.15 $1.50
Krater
Krater
Perceptron: Perceptron Mk1
perceptron-mk1 - -