KAIST builds on‑device AI chip

KAIST researchers unveiled an on‑device AI chip that learns user habits to enable hyper‑personalization in real time without cloud dependency — a development that could power highly adaptive assistive technologies for learning. The chip foreshadows a wave of privacy‑friendly, personalized accessibility features in edtech devices. (koreaherald.com) (en.sedaily.com)

KAIST’s team led by Professor Hoi‑Jun Yoo revealed the chip named “SoulMate” in a March 17, 2026 announcement. (koreaherald.com) SoulMate is described in the ISSCC paper as a 28nm CMOS system‑on‑chip that combines retrieval‑augmented generation (RAG) and Low‑Rank Adaptation (LoRA) to personalize a compact LLaMA3.2‑1B model with ~1,000 context tokens. (ssl.kaist.ac.kr) The semiconductor demonstrates a measured operating power envelope of 9.8–180.5 mW and reports energy efficiencies of 26.3 μJ/token for inference and 56.8 μJ/token for fine‑tuning. (ssl.kaist.ac.kr) KAIST’s implementation claims a user‑interaction latency of about 216.4 ms and cites typical cloud time‑to‑first‑token (TTFT) delays that can exceed 400 ms as a baseline comparison. (ssl.kaist.ac.kr) The system stores dialogue history in a 32 MB on‑device database and uses a 4 MB off‑chip replay buffer to collect user feedback for LoRA‑based updates. (ssl.kaist.ac.kr) KAIST reports that, prior to their optimizations, redundant model updates accounted for roughly 73% of total system energy consumption, a key engineering challenge the team addressed. (ssl.kaist.ac.kr) The research was presented at the International Solid‑State Circuits Conference (ISSCC) 2026 in San Francisco and was selected as a highlight paper where the team demonstrated a working chip that adjusted response style in real time. (koreaherald.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.