Debugging Distributed Systems

In this project, we aim to develop debugging techniques for distributed systems and sensor networks as such systems become more and more pervasive. We focus on post-mortem analyses for runtime failures.

Our contributions are highlighted as follows.

Funding

A Holistic Approach to Reliable Pervasive Systems, NSF-CSR-0834529, 2008-2011.

Students

Publications

DSN K. Lee, W. N. Sumner, X. Zhang and P. Eugster. Unified Debugging of Distributed Systems with Recon ,
the 41th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2011.

[abstract][pdf]
PLDI K. Lee, Y. Zheng, W. N. Sumner, X. Zhang. Toward Generating Reduciable Replay Log ,
ACM SIGPLAN Conference on Programming Language Design and Implementation, 2011.

[abstract][pdf]
SENSYS V. Sundaram, P. Eugster and X. Zhang. Efficient Diagnostic Tracing Support forWireless Sensor Networks ,
the 8th ACM Conference on Embedded Networked Sensor Systems, 2010.

[abstract][pdf]
SRDS B. Xin, P. Eugster, X. Zhang, and J. Yang . Lightweight Task Graph Inference for Distributed Applications ,
the 29th IEEE International Symposium on Reliable Distributed Systems, 2010.

[abstract][pdf]