CSE 597E: High Performance Computing on a Network of Workstation (Spring 1997)
Instructor: Anand Sivasubramaniam (anand@cse)
Office: 316 Pond Lab
Course Overview:
Parallel computer architecture has been steadily evolving over the
past decade. This course will cover state-of-the-art issues in this area,
specifically addressing operating systems and architectural support for
emerging cost-effective platforms such as Networks of Workstations (NOW).
We will cover network support, efficient user-level communication software,
latency reduction and tolerance, global resource management, file systems, I/O,
programming support, and applications issues for these platforms via
student presentation of recent literature.
Students should also submit a class project at the end.
The projects may involve literature survey, programming on a NOW connected
by ATM and Myrinet, and performance evaluation. Grading will be based on
class participation, the presentations and the project.
List of Papers:
Overview:
-
T. Anderson et al.,
A Case for Networks of Workstations,
IEEE Micro, Pages 54-64, February 1995.
Case Studies of Parallel Architectures:
-
C. E. Leiserson et al.,
The Network Architecture of the Connection Machine CM-5,
Proceedings of the ACM Symposium on Parallel Algorithms and Architectures,
1992.
-
A. Krishnamurthy et al.
Evaluation of Architectural Support for Global Address-based Communication in
Large Scale Parallel Machines,
Proceedings of ASPLOS, Pages 37-48, October 1996.
Network Interfaces and Networking:
-
M. A. Blumrich, C. Dubnicki, E. W. Felten, K. Li and M. R. Mesarina,
Two Virtual Memory Mapped Network Interface Designs,
Hot Interconnects Symposium, pp. 134-142, August 1994.
-
M. A. Blumrich, K. Li, R. Alpert, C. Dubnicki, E. W. Felten and J. Sandberg,
Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer,
Proceedings of the International Symposium on Computer Architecture,
pp. 142-153, April 1994.
-
N. J. Boden et al.,
Myrinet: A Gigabit-per-second Local Area Network,
IEEE Micro, Pages 29-36, February 1995.
-
T. Chaney, A Fingerhut, M. Flucke, J. Turner,
Design of a Gigabit ATM Switch,
Technical Report WUCS-96-07,
Washington University, St. Louis,
February 1996.
-
ForeRunner ATM Switch Architecture,
Version 1.0, Fore Systems, Warrendale PA,
1996.
-
ATM Service Categories: The Benefits to the User,
ATM Forum White Paper, 1996.
-
Z. Dittia, G. Parulkar; and J. R. Cox,
The APIC Approach to High Performance Network Interface Design: Protected
DMA and Other Techniques,
Proceedings of IEEE Infocom 1997.
Communication Software:
-
S. Pakin, Lauria, A. Chien,
High Performance Messaging on Workstations: Illinois Fast Messages (FM) For
Myrinet,
Proc. of Supercomputing '95.
-
Lauria, A. Chien,
MPI-FM : High Performance MPI on Workstation Clusters
Journal of Parallel and Distributed Computing, February 1997.
-
T. von Eicken, A. Basu, V. Buch, and W. Vogels,
U-Net: A User-Level Networking Interface for Parallel and Distributed
Computing,
Proc. of the 15th ACM Symposium on Operating Systems Principles,
Pages 1-14, December 1995.
-
A. Basu, M. Welsh, T. von Eicken,
Incorporating Memory Management into User-Level Networking Interface,
Technical Report, Cornell University, 1996.
Scheduling:
-
R. H. Arpaci et al.,
The Interaction of Parallel and Sequential Workloads on a Network of
Workstations,
Proceedings of ACM Sigmetrics,
May 1995.
-
A. C. Dusseau, R. H. Arpaci, D. E. Culler,
Effective Distributed Scheduling of Parallel Workloads,
Proceedings of ACM Sigmetrics,
May 1996.
-
A. Acharya, G. Edjlali and J. Saltz,
The Utility of Exploiting Idle Workstations for Parallel Computation,
Proceedings of ACM Sigmetrics, June 1997.
Global Memory Management:
-
E. W. Felten and J. Zahorjan,
Issues in the Implementation of a Remote Memory Paging System,
Technical Report, Univ of Washington,
February 1991.
-
M. J. Feeley, W. E. Morgan, F. H. Pighin, A. R. Karlin, H. M. Levy,
Implementing Global Memory Management in a Workstation Cluster,
Proceeding of the 15th ACM Symposium on Operating Systems Principles,
December 1995.
-
H. A. Jamrozik, M. J. Feeley, G. M. Voelker, J. Evans II, A. R. Karlin,
H. M. Levy, M. K. Vernon,
Reducing Network Latency Using Subpages in a Global Memory Environment,
Proceedings of the Seventh ACM Conference on Architectural Support for
Programming Languages and Operating Systems,
October 1996.
File Systems:
Input-Output:
-
E.K. Lee and C. A. Thekkath,
Petal : Distributed Virtual Disks,
Proceedings of the Seventh ACM Conference on Architectural Support for
Programming Languages and Operating Systems,
October 1996.
-
S. Asami, N. Talagala, T. Anderson, K. Lutz, and D. Patterson,
The Design of Large-Scale, Do-It-Yourself RAIDs,
Draft 1.0,
November 1995.
Applications:
-
A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, D. E. Culler, J. M. Hellerstein,
and D. A. Patterson,
High-Performance Sorting on Networks of Workstations,
Proceedings of ACM SIGMOD, 1997.
-
B. L. Tierney and W. E.Johnston,
The Image Server System: A High-Speed Parallel Distributed Data Server,
Lawrence Berkeley Lab Report 36002.
-
D Pegler, N Yeadon, D Hutchison and D Shepherd,
Incorporating Scalability into Networked Multimedia Storage,
Technical Report, Dept of Computing, Lancaster University.
Back to Anand Sivasubramaniam's Home Page