Federated Learning Gets an Upgrade

A new study demonstrates a way to mitigate "catastrophic forgetting" in federated learning setups using graph neural networks. This technique allows for continual model updates across distributed systems without pooling sensitive data, a key challenge in financial risk modeling.

Catastrophic forgetting is a fundamental challenge in machine learning where a neural network, upon learning a new task, overwrites and forgets previously learned information. This phenomenon, first identified in 1989, is a major obstacle to creating AI systems that can learn sequentially and continuously adapt without constant, expensive retraining on their entire dataset. In federated learning, this problem is amplified. When a central model is updated with learnings from multiple local devices—each with its own unique data distribution—the model can overfit to the most recent local data, degrading its performance on knowledge learned from other sources. This issue, known as data heterogeneity, is a primary cause of performance degradation in federated systems. For the financial sector, this poses a significant risk. Banks and financial institutions use federated learning to collaboratively build robust models for fraud detection and risk assessment without sharing private customer data. If the shared model "forgets" fraud patterns seen by one bank while learning new ones from another, its overall effectiveness is compromised. Graph neural networks (GNNs) are particularly effective in finance because they can model the complex, network-like relationships between transactions, accounts, and entities to uncover sophisticated fraud rings. The challenge is to continually update these complex graph models in a federated system without losing critical, previously-learned patterns. Several techniques exist to combat catastrophic forgetting, including regularization methods like Elastic Weight Consolidation (EWC), which selectively slows down learning on weights crucial for previous tasks. Another approach involves "replay" techniques, where a small subset of past data is stored and periodically shown to the model during new training cycles. The development of novel techniques using GNNs represents a significant step forward. By preserving historical information within the graph structure itself, these methods aim to create more stable and continuously learning models, a critical requirement for deploying reliable AI in dynamic, high-stakes environments like financial risk management.

Federated Learning Gets an Upgrade

Get your own daily briefing