Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
software [2019/07/03 14:12]
fablpd
software [2021/01/14 11:44]
guirguis
Line 1: Line 1:
-====== Software projects developed at LPD ======+====== Software projects developed at DCL ======
  
 DCL has a github page where most new software projects are published: [[https://​github.com/​LPD-EPFL/​]] DCL has a github page where most new software projects are published: [[https://​github.com/​LPD-EPFL/​]]
 +
 +===== AggregaThor =====
 +
 +Designed for the MLSYS '19 paper: "//​AggregaThor:​ Byzantine Machine Learning via Robust Gradient Aggregation//​."​
 +
 +AggregaThor is the first scalable Byzantine resilient framework for distributed machine learning applications. AggregaThor is built on top of TensorFlow while achieving transparency:​ applications built with TensorFlow do not need to change their interfaces to be made Byzantine-resilient. AggregaThor uses the parameter server architecture,​ and it adds (to vanilla TensorFlow) two main layers: (1) the aggregation layer and (2) the communication layer. The former uses a statistically-robust gradient aggregation rule, called Multi-Krum, to robustly aggregate workers'​ gradients, ensuring convergence of training even in the existence of malicious workers. The communication layer enables users to experiment with unreliable transport layer (i.e., using UDP), which achieves better performance than vanilla TensorFlow in highly-saturated networks. AggregaThor source code was evaluated by experts and was given ACM accreditations for being functional and reusable.
 +
 +[[https://​github.com/​LPD-EPFL/​AggregaThor|Code]]
  
 ===== MVTIL ===== ===== MVTIL =====