We release the following software systems for academic purposes, under the
GNU/GPL license. For the latest released software, please refer to the
DeltaINT: Toward General In-band Network Telemetry with Extremely Low
A bandwidth-efficient in-band network telemetry prototype that supports
general network management applications.
StreamLEC: Enabling Low-Redundancy Proactive Fault Tolerance for Stream
Machine Learning via Erasure Coding
A distributed stream processing prototype that provides fault
tolerance for stream machine learning via erasure coding.
AIHS: Automated Intelligent Healing in Cloud-Scale Data Centers
A simulator for automated intelligent healing management.
SGXDedup: Accelerating Encrypted Deduplication via SGX
(USENIX ATC 2021)
An encrypted deduplication storage prototype that achieves high performance
with Intel SGX.
DiffKV: Differentiated Key-Value Storage Management for Balanced I/O
(USENIX ATC 2021)
An LSM-tree key-value store prototype that realizes differentiated key-value
RepairBoost: Boosting Full-Node Repair in Erasure-Coded Storage
(USENIX ATC 2021)
A prototype for improving the full-node repair performance in erasure-coded
StripeMerge: Efficient Wide-Stripe Generation for Large-Scale Erasure-Coded
A prototype that realizes efficient wide-stripe generation for large-scale
RackCU: Optimal Rack-Coordinated Updates in Erasure-Coded Data
A prototype for optimal rack-coordinated updates in erasure-coded data
ECWide: Exploiting Combined Locality for Wide-Stripe Erasure Coding in
A prototype that realizes combined locality for wide-stripe erasure
StreamDFP: Toward Adaptive Disk Failure Prediction via Stream Mining
A stream-based data mining prototype for disk failure prediction.
Enabling I/O-Efficient Redundancy Transitioning in Erasure-Coded KV Stores via
Elastic Reed-Solomon Codes.
An implementation of Elastic Reed-Solomon codes on Memcached for I/O-efficient
OmniMon: Re-architecting Network Telemetry with Resource Efficiency and Full
A new network telemetry architecture that aims for both resource efficiency
and full accuracy.
AustereCache: Austere Flash Caching with Deduplication and Compression
(USENIX ATC 2020)
A flash caching prototype with deduplication and compression.
On the Optimal Repair-Scaling Trade-off in Locally Repairable Codes
A storage prototype that realizes repair and scaling operations in locally
SpreadSketch: Toward Invertible and Network-Wide Detection of
A new invertible sketch design for superspreader detection. Both software and
P4 implementations are included.
TEDStore: Balancing Storage Efficiency and Data Confidentiality with Tunable
A tunable encrypted deduplication storage prototype.
UniKV: Toward High-Performance and Scalable KV Storage in Mixed Workloads
via Unified Indexing
A high-performance key-value store based on LevelDB for mixed workloads.
ECHash: Coupling Decentralized Key-Value Stores with Erasure Coding
An erasure-coded key-value store based on Memcached for efficient scaling.
FastPR: Fast Predictive Repair in Erasure-Coded Storage
A fast predictive repair prototype for erasure-coded storage on HDFS 3.1.1.
POCache: Parity-Only Caching for Robust Straggler Tolerance
A parity-only caching prototype built on HDFS.
Metadedup: Deduplicating Metadata in Encrypted Deduplication via
An encrypted deduplication prototype that realizes metadata deduplication.
MV-Sketch: A Fast and Compact Invertible Sketch for Heavy Flow Detection in
Network Data Streams
A new invertible sketch design with static memory allocation.
OpenEC: Toward Unified and Configurable Erasure Coding Management in
Distributed Storage Systems
A new framework for unified and configurable erasure coding management.
SketchLearn: Relieving User Burdens in Approximate Measurement with Automated
A prototype that realizes sketch-based measurement with automated statistical
inference. Released in August 2018.
(Awarded with badges)
CAU: Cross-rack-aware Updates for Erasure-Coded Data Centers
A prototype that realizes cross-rack-aware updates for erasure-coded data
centers. Released in August 2018.
HashKV: A High-performance KV Store for Update-intensive Workloads
(USENIX ATC 2018)
A high-performance KV store for update-intensive workloads based on hash-based
data grouping. Released in July 2018.
CellPAD: Detecting Performance Anomalies in Cellular Networks via Regression
Analysis (Networking 2018)
A KPI anomaly detection tool for cellular network management. Released in
DoubleR: Optimal Repair Layering for Erasure-Coded Data Centers
An optimal repair layering framework on erasure-coded HDFS that minimizes
cross-rack repair traffic. Released in October 2017.
SimEDC: Simulation Analysis of Reliability in Erasure-Coded Data
Centers (SRDS 2017)
A simulator that evaluates the storage reliability of erasure-coded data
centers via discrete-event simulations. Released in September 2017.
ECPipe: Repair Pipelining for Erasure-Coded Storage
(USENIX ATC 2017, TOS 2021)
A prototype that achieves fast repair for general erasure-coded storage.
Release in July 2017.
Information Leakage in Encrypted Deduplication via Frequency
Analysis (DSN 2017)
Attack and defense toolkits against a deduplication-based storage dataset via
frequency analysis. Release in June 2017.
MemEC: An Erasure-coding-based Distributed In-Memory KV store
An erasure-coding-based distributed in-memory KV store optimized for small KV
objects. Release in May 2017.
AF-Stream: A High-Performance Distributed Stream Processing System based on
Approximate Fault Tolerance
A High-Performance Distributed Stream Processing System based on Approximate
Fault Tolerance. Release in November 2016.
REED: Rekeying for Encrypted Deduplication Storage
A rekeying-aware encrypted deduplication storage system. Released in June
EPLog: Elastic Parity Logging for SSD RAID Arrays
A user-level software layer that achieves high reliability, endurance, and
performance for SSD RAID. Released in June 2016.
CDStore: Toward Reliable, Secure, and Cost-Efficient Cloud Storage via
(USENIX ATC 2015)
A multi-cloud storage system that unifies reliability (fault tolerance),
security, and deduplication. Released in May 2015.
Encoding-Aware Replication in Clustered File Systems
A new replication scheme for clustered file systems. Both the simulator and
Hadoop implementations are available. Released in May 2015.
EDP: Even Data Placement in Distributed Reliable Deduplication Storage
Systems (IWQoS 2015)
A distributed reliable deduplication system prototype that realizes even data
placement. Released in May 2015.
FastDR: Boosting Degraded Reads in Heterogeneous Erasure-Coded Storage
A fast degraded-read system for erasure-coded HDFS (based on HDFS-RAID).
Released in October 2014.
Degraded-First Scheduling: An Efficient MapReduce Task Scheduler for
erasure-coded HDFS (DSN 2014)
An efficient MapReduce task scheduler that enables efficient MapReduce
execution on erasure-coded HDFS under failure mode. Released in April
CodFS: An Erasure-Coded Clustered Storage System for Efficient Updates and
Recovery (FAST 2014)
An erasure-coded clustered storage system prototype that supports efficient
recovery and updates through an idea called parity logging with reserved space.
Released in January 2014.
STAIR Codes: A General Family of Erasure Codes for Tolerating Device and
Sector Failures (FAST 2014)
A C library of STAIR codes, which provide general construction of erasure
codes for simultanesouly tolerating device failures and sector errors in a
space-efficient manner. Released in January 2014.
RevDedup: Efficient Hybrid Inline and Out-of-line Deduplication for Backup
Storage (APSYS 2013, TOS 2014)
A prototype that achieves high read throughput for latest backups in
deduplication storage, while maintaining high write throughput and high
deduplication efficiency. Released in July 2013.
CORE: Regenerating-coding-based recovery for single and concurrent
failures (MSST 2013)
A prototype for enabling regenerating-coding-based recovery for single and
concurrent failures. It builds on HDFS-RAID. Released in April 2013.
Cloud-to-Device-Messaging (C2DM) Botnet (ACSAC 2012)
The C2DM botnet is a proof-of-concept botnet prototype that exploits Google's
Cloud to Device Messaging (C2DM) service as its C&C channel.
Released in September 2012.
FMSR-DIP: Functional Minimum Storage Regenerating Code with Data Integrity
Protection (SRDS 2012, TPDS 2014)
A prototype for enabling data integrity protection in regenerating-coded cloud
storage. Released in July 2012.
ADAM: An Automatic and Extensible Platform to Stress Test Android
Implementation of assessing the robustness of Android Anti-Virus Systems by
generating various types of malware variants. Released in April 2012.
CHR: A C Library for Cost-Based Heterogeneous Recovery for RAID-6
codes (DSN 2012)
A C library API for fast and effective cost-based heterogeneous recovery for
RDP and EVENODD codes. Released in April 2012.
Zpacr: A C Library in Searching for the Optimal Single-Disk Failure Recovery
Solution for XOR-based Erasure Codes (MSST 2012)
A C library API for single-disk failure recovery in XOR-Coded storage
systems. Released in March 2012.
NCCloud: Network-Coding-Based File System for Cloud Storage
(FAST 2012, INFOCOM 2013, TC 2014)
A cloud storage system that realizes minimum-storage regenerating codes for
multiple-cloud storage. Released in January 2012.
CloudVS: A Cloud-based Version Control System
A virtual machine version control system for Eucalyptus-based open-source
cloud platforms. Released in January 2012.
LiveDFS: Live Deduplication File System
A Linux kernel-space file system that supports live deduplication. One
application is for the virtual machine image storage. Released in
DeRef: A Privacy-Perserving Mechanism Against Request Forgery Attacks
Implementation of a web-based mechanism against request forgery.
Released in October 2011.
NCFS: Network-Coding-Based Distributed File System
An extensible platform for realizing theories of network coding in practical
distributed storage systems. Researchers can extend NCFS to experiment
new storage schemes based on erasure codes and regenerating codes. Released in
LVRM: Load-aware Virtual Router Monitor
(ICPP Workshop 2011)
Implementation of a user-space, load-aware virtual router monitor.
Released in February 2011.
FADE: Secure Overlay Cloud Storage with File Assured Deletion
(SecureComm 2010, TDSC 2012)
Implementation of a secure overlay cloud storage system that supports file
assured deletion. Released in September 2010.
Stable Opportunistic Routing (SOR)
Nsclick implementation of Stable Opportunistic Routing. Released in February
communicAtion Library (SEAL) (JSS 2007)
C language API which provides necessary software components for developers to
write secure dynamic group-oriented applications without any centralized key
server. Released in 2003.