Perceptron Mk1 is a vision-language model that understands images and video. Ask it questions, detect objects, read text, get captions, or clip events — all through a simple API.Documentation Index
Fetch the complete documentation index at: https://docs.perceptron.inc/llms.txt
Use this file to discover all available pages before exploring further.
Try Perceptron Mk1 in 30 seconds
Create an API key
Get your key from the Perceptron platform
Join Discord
Get help and see what others are building
Get started with Image
Step through this example interactively
Get started with Video
Step through this example interactively
Using Python? Install with
pip install perceptron or pip install openaitemperature defaults to 0.0). See API Reference for all parameters.
Explore our developer guides
Image Q&A
Ask questions about images and get grounded answers
Video Q&A
Ask questions about video and get answers grounded in time
Object Detection
Locate targets with precise bounding boxes
Video Clipping
Find events in video and return start/end timestamps
OCR
Extract text from images and documents
Image Captioning
Generate descriptions of images
In-Context Learning (Image)
Adapt Perceptron Mk1 to image tasks with a handful of examples
In-Context Learning (Video)
Adapt Perceptron Mk1 to video tasks with a handful of examples
Models overview
| Model | Best for | Speed | Latest update |
|---|---|---|---|
perceptron-mk1 | Image & Video, reasoning enabled | Standard | 2026-05-12 |
isaac-0.2-2b-preview | Image, reasoning enabled | Fast | 2025-12-10 |
isaac-0.2-1b | Image, reasoning enabled, low-latency / edge deployment | Fastest | 2025-12-10 |
isaac-0.1 | Images (legacy support) | Fast | 2025-09-17 |
Model details
Model details
Perceptron Mk1
Best-in-class closed-source VLM with reasoning — accepts image and video inputs. (“Mk1” is short for “Mark 1”.)- Model ID:
perceptron-mk1 - Context: 32K tokens
- Reasoning: Yes
- Pricing: $0.15/M input, $1.50/M output
- Closed source
isaac-0.2-2b-preview
Best-in-class open-weights 2B VLM with reasoning. Sub-200ms time-to-first-token.- Model ID:
isaac-0.2-2b-preview - Context: 8K tokens
- Reasoning: Yes
- Pricing: $0.15/M input, $1.25/M output
- Open weights on Hugging Face
isaac-0.2-1b
Compact 1B VLM with reasoning, optimized for edge and low-latency deployments.- Model ID:
isaac-0.2-1b - Context: 8K tokens
- Reasoning: Yes
- Pricing: $0.15/M input, $1.25/M output
- Open weights on Hugging Face
isaac-0.1
Original 2B VLM, still supported for existing integrations.- Model ID:
isaac-0.1 - Context: 8K tokens
- Reasoning: No
- Pricing: $0.15/M input, $1.25/M output
- Open weights on Hugging Face
Benchmarks
Perceptron Mk1 benchmark results:


