MIT cuts on‑device training energy 90%
- MIT researchers on April 29 unveiled FTTE, a federated learning method for edge devices that speeds privacy-preserving training on local data. - In tests, FTTE converged about 81 percent faster, cut on-device memory use 80 percent, and reduced communication payloads 69 percent. - The work targets sensors, wearables, and other constrained devices in health care and finance. (news.mit.edu)
Training a model usually means shipping data to a server farm, but MIT says a new method can keep that work on everyday devices. (news.mit.edu) The method is called FTTE, short for Federated Tiny Training Engine, and MIT described it on April 29, 2026. Lead author Irene Tenison said it was built for resource-constrained edge devices such as sensors and smartwatches. (news.mit.edu) (arxiv.org) The basic setup is federated learning: a shared model is sent to many devices, each device trains on its own local data, and only updates go back. Raw data stays on the device, which is why the approach is used for privacy-sensitive settings. (news.mit.edu) (arxiv.org) That system breaks down when devices are uneven. Some clients have little memory, weak processors, spotty connections, or arrive late, and those “stragglers” can slow the whole training run. (news.mit.edu) (arxiv.org) FTTE tries to fix that by sending sparse parameter updates instead of full ones. In plain terms, devices transmit only the most useful slices of a model update, which cuts both memory pressure and network traffic. (arxiv.org) It also uses semi-asynchronous training, which means the server does not wait for every device before moving forward. The paper says FTTE weights incoming updates by both their age and their variance, so stale updates count less. (arxiv.org) In experiments across up to 500 clients with as many as 90 percent stragglers, the team reported 81 percent faster convergence than synchronous federated learning baselines such as FedAvg. The same tests showed 80 percent lower on-device memory use and 69 percent smaller communication payloads. (arxiv.org) (news.mit.edu) MIT said the target applications include health care and finance, where data privacy rules are strict and hardware is often limited. The co-authors are Anna Murphy, Charles Beauville, and Lalana Kagal, and the team said the work will be presented at the IEEE International Joint Conference on Neural Networks. (news.mit.edu) The result is narrower than “on-device training energy down 90 percent.” MIT’s published numbers are about 81 percent faster convergence, 80 percent lower memory use, and 69 percent less communication, all in federated learning tests on constrained devices. (news.mit.edu) (arxiv.org) MIT’s claim is that smaller, slower devices no longer have to sit out privacy-preserving training. FTTE is meant to let more of them participate without sending their raw data back to the cloud. (news.mit.edu)