I’m happy to introduce “CRAFT”: a new federated learning optimizer that treats aggregation as a geometric correction problem instead of naive averaging. This is neat work with Ziqi from FAU and Qiang, building on previous version of our ConIFG optimizer.
CRAFT explicitly enforces positive alignment between the global update and every participating client update. This gives:
- higher mean accuracy
- substantially improved worst-client performance
- reduced client-level disparity under highly heterogeneous non-IID settings
Key ingredients are a closed-form conflict resolution via Moore–Penrose projections, momentum-like reference updates, and layer-wise conflict handling for deep neural networks.
You can find the full paper at: https://arxiv.org/abs/2605.21317
and Code on GitHub: https://github.com/tum-pbs/CRAFT
Original ConFIG: https://tum-pbs.github.io/ConFIG/
Overview: CRAFT introduces a new aggregation strategy for federated learning (FL) under heterogeneous, non-IID client data distributions. Instead of averaging client updates as in FedAvg, CRAFT formulates aggregation as a geometric correction problem: the global update is chosen to stay as close as possible to a reference direction while explicitly enforcing positive alignment with every participating client update.
The key idea is to resolve gradient conflicts directly at the aggregation stage. CRAFT derives a closed-form solution using Moore–Penrose projections, avoiding expensive iterative optimization while guaranteeing conflict-free alignment whenever feasible. The method further introduces a layer-wise formulation, allowing conflicts to be corrected independently across neural network layers, which is particularly important for deep models.
Unlike prior conflict-resolution approaches such as ConFIG, CRAFT uses the previous global update as a momentum-like geometric reference. This stabilizes optimization across communication rounds while still adapting to the current set of participating clients. The resulting update preserves useful historical information and only applies the minimum correction required to satisfy alignment constraints.
Experiments on FEMNIST and CIFAR-10/100 with CNNs and deep ResNets demonstrate substantial gains over existing FL baselines, including FedAvg, FedProx, FedNova, FedAdam, FedLF, FedFV, qFedAvg, AFL, and ConFIG. CRAFT consistently improves mean client accuracy while simultaneously reducing client-level disparity. For example, on CIFAR-10 with ResNet-20, CRAFT improves mean client accuracy from 0.525 (ConFIG) to 0.806 and significantly improves worst-client performance.
Beyond standard federated learning, we also show that CRAFT acts as a plug-in aggregation operator for personalized FL methods such as Ditto. Simply replacing averaging with CRAFT inside Ditto yields strong additional gains, especially in challenging heterogeneous settings.
Overall, the paper reframes federated aggregation from averaging to constrained geometric alignment, providing a lightweight and theoretically grounded mechanism for improving both performance and fairness in heterogeneous federated learning.
