Introducing MiniCPM-o 2.6: An 8B size, GPT-4o level Omni Model runs on device
Highlights:
~Match GPT-4o-202405 in vision, audio and multimodal live streaming
~End-to-end real-time bilingual audio conversation ~Voice cloning & emotion control
~Advanced OCR & video understanding
~Offline iPad-compatible multimodal live streaming