GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network Dan Bonachea and Paul H. Hargrove
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM), November 2018.
https://doi.org/10.1109/PAW-ATM.2018.00008 PDF,
Slides PDF
GASNet-EX: A High-Performance, Portable Communication Library for Exascale Dan Bonachea and Paul H. Hargrove
Proceedings of Languages and Compilers for Parallel Computing (LCPC'18), Oct 2018.
https://doi.org/10.25344/S4QP4W PDF,
Slides PDF
Congestion Avoidance on Manycore High Performance Computing Systems Miao Luo, Dhabaleswar K. Panda, Khaled Z. Ibrahim, and Costin Iancu
26th ACM International Conference on Supercomputing (ICS), 2012.
PDF available from the publisher.
Tuning collective communication for Partitioned Global Address Space programming models Rajesh Nishtala, Yili Zheng, Paul Hargrove, Katherine A. Yelick
Parallel Computing 37(9): 576-591 (2011).
PDF
Hybrid PGAS Runtime Support for Multicore Nodes Filip Blagojevic, Paul Hargrove, Costin Iancu, and Katherine Yelick
Fourth Conference on Partitioned Global Address Space Programming Model (PGAS10), Oct 2010.
PDF
Oversubscription on multicore processors C. Iancu, S. Hofmeyr, F. Blagojevic, and Yili Zheng
24rd International Parallel & Distributed Processing Symposium (IPDPS), 2010.
PDF
Porting GASNet to Portals: Partitioned Global Address Space (PGAS) Language Support for the Cray XT
D. Bonachea, P. Hargrove, M. Welcome, K. Yelick.
Cray User Group (CUG), 2009.
https://doi.org/10.25344/S4RP46 Paper PDFSlides PDF
Scaling Communication-Intensive Applications on BlueGene/P
Using One-Sided Communication and Overlap
R. Nishtala, P. Hargrove, D. Bonachea, K. Yelick.
23rd International Parallel & Distributed Processing Symposium (IPDPS), 2009.
https://doi.org/10.1109/IPDPS.2009.5161076 PDF
Optimizing Collective Communication on Multicores R. Nishtala and K. Yelick.
HotPar 2009, Berkeley, CA, USA, March 2009
PDF
Pthreads or Processes: Which is Better for Implementing Global Address Space Languages? Jason Duell
Masters Report, August 2007.
Computer Science Division, UC Berkeley
PDF
Evaluating Support for Global Address Space Languages on the Cray X1 C. Bell, W. Chen, D. Bonachea, K. Yelick.
International Conference on Supercomputing (ICS) 2004.
https://doi.org/10.1145/1006209.1006236 PDF available.
An Evaluation of Current High-Performance Networks C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, K. Yelick.
17th International Parallel and Distributed Processing Symposium (IPDPS), 2003.
https://doi.org/10.1109/IPDPS.2003.1213106 Abstract,
Postscript,
PDF of the paper are available, as well as PowerPoint slides from the presentation.
A New DMA Registration Strategy for Pinning-Based High Performance Networks C. Bell and D. Bonachea.
International Parallel and Distributed Processing Symposium (IPDPS), 2003.
https://doi.org/10.1109/IPDPS.2003.1213363 PDF of the paper available, as well as
PowerPoint slides from the presentation.
Problems with using MPI 1.1 and 2.0 as compilation targets
for parallel language implementations
Dan Bonachea and Jason Duell.
2nd Workshop on Hardware/Software Support for High Performance Scientific and Engineering Computing, SHPSEC-PACT03
https://doi.org/10.25344/S4JP4B PDF of the paper available,
as well as slides from the presentation.
Journal version:
Int. J. High Performance Computing and Networking, Vol. 1, Nos. 1/2/3, pp.91-99, 2004.
https://escholarship.org/uc/item/7v6644bs PDF available.
Compiler Optimizations and Analysis
Efficient Data Race Detection for Distributed Memory Parallel Programs C.-S. Park, K. Sen, P. Hargrove, C. Iancu
Proceedings of Supercomputing 2011.
PDF
Automatic Nonblocking Communication for Partitioned Global Address Space Programs W. Chen, D. Bonachea, C. Iancu, K. Yelick.
International Conference on Supercomputing (ICS), 2007.
https://escholarship.org/uc/item/9h2915st PDF
Communication Optimizations for Fine-grained UPC Applications W. Chen, C. Iancu, K. Yelick.
14th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2005.
Lawrence Berkeley National Lab Tech Report LBNL-58382
https://escholarship.org/uc/item/0j25j80n PDF
HUNTing the Overlap C. Iancu, P. Husbands, P. Hargrove.
14th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2005.
Lawrence Berkeley National Lab Tech Report LBNL-58381
https://escholarship.org/uc/item/1n02118z PDF
Polynomial-time Algorithms for Enforcing Sequential Consistency in
SPMD Programs with Arrays W. Chen, A. Krishnamurthy, K. Yelick.
16th International Workshop on Languages and Compilers for Parallel
Computing (LCPC), 2003.
PDF
Building a Source-to-Source UPC-to-C Translator W. Chen
Masters Report, 2005.
PDF
Applications
Communication Avoiding and Overlapping for Numerical Linear Algebra Evangelos Georganas, Jorge Gonzáez-Domínguez, Edgar Solomonik, Yili Zheng, Juan Touriño and Katherine Yelick
Proceedings of Supercomputing 2012.
Multithreaded Global Address Space Communication Techniques for Gyrokinetic Fusion Applications on Ultra-Scale Platforms Robert Preissl, Nathan Wichmann, Bill Long, John Shalf, Stephane Ethier, and Alice Koniges
Proceedings of Supercomputing 2011.
PDF
Multi-Threading and One-Sided Communication in Parallel LU Factorization Parry Husbands, Katherine Yelick
Supercomputing 2007.
PDF
Performance Analysis
A preliminary evaluation of the hardware acceleration of the Cray Gemini Interconnect for PGAS languages and comparison with MPI
Hongzhang Shan, Nicholas Wright, John Shalf, Katherine Yelick, Marcus Wagner and Nathan Wichmann
Second International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS11), ACM Sigmetrics, 2011.
PDF
Optimizing UPC programs for multi-core systems Yili Zheng
Scientific Programming 18(3-4): 183-191 (2011).
doi:10.3233/SPR-2010-0310
A programming model performance study using the NAS parallel benchmarks Hongzhang Shan, Filip Blagojevic, Seung-Jai Min, Paul Hargrove, Haoqiang Jin, Karl Fuerlinger, Alice Koniges and Nicholas J. Wright
Scientific Programming 18(3-4): 153-167 (2011).
doi:10.3233/spr-2010-0306 PDF on OSTI
GASP! A Standardized Performance Analysis Tool Interface for Global Address Space Programming Models Hung-Hsun Su, Dan Bonachea, Adam Leko, Hans Sherburne, Max Billingsley III, Alan D. George
Lecture Notes in Computer Science, 4699 LNCS, 450-459, 2006.
https://doi.org/10.1007/978-3-540-75755-9_54 PDF
GASP: A Performance Analysis Tool Interface for Global Address Space Programming Models, Specification Version 1.5 Adam Leko, Dan Bonachea, Hung-Hsun Su, and Alan D. George
Lawrence Berkeley National Lab Tech Report LBNL-61606, Sep 14 2006.
https://doi.org/10.2172/918569 PDF
Message Strip Mining Heuristics for High Speed Networks C. Iancu, P. Husbands, W. Chen.
VECPAR 2004.
Postscript
A Performance Analysis of the Berkeley UPC Compiler W. Chen, D. Bonachea, J. Duell, P. Husbands, C. Iancu, K. Yelick.
17th Annual International Conference on Supercomputing (ICS), 2003.
https://doi.org/10.1145/782814.782825 Postscript, PDF, and the presentation slides available.
Proposal for Extending the UPC Memory Copy Library Functions, v2.0 (March 22, 2007)
D. Bonachea
UPC Community Forum, 2007.
PDF available.
Same document with additional sections describing supporting GASNet extensions: PDF available.
The archival version for citation in publication is:
Proposal for Extending the UPC Memory Copy Library Functions and Supporting Extensions to GASNet, v2.0 (March 22, 2007)
Lawrence Berkeley National Lab Tech Report LBNL-56495 v2.0
https://doi.org/10.2172/920052 PDF available.
A Proposal for a UPC Memory Consistency Model, v1.1 (Aug 30, 2004) K. Yelick, D. Bonachea and C. Wallace.
PS and PDF available.
The archival version for citation in publication is:
A Proposal for a UPC Memory Consistency Model, v1.0 (May 5, 2004)
Lawrence Berkeley National Lab Tech Report LBNL-54983
https://doi.org/10.2172/823757 PS and PDF available.
UPC-IO: A Parallel I/O API for UPC, v1.0 T. El-Ghazawi, F. Cantonnet, P. Saha, R. Thakur, R. Ross, D. Bonachea
UPC community forum, 2003.
PDF available.
Introduction to UPC and Language Specification W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren. CCS-TR-99-157,
IDA Center for Computing Sciences, 1999.
PDF available.
For the most up-to-date version of the spec, see Documentation.
Other (non-UPC) PGAS Research
A Local-View Array Library for Partitioned Global Address Space C++ Programs Amir Kamil, Yili Zheng, and Katherine Yelick
ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming, 2014.
PDF available.
UPC++: A PGAS Extension for C++ Yili Zheng, Amir Kamil, Michael Driscoll, Hongzhang Shan, and Katherine Yelick
28th IEEE International Parallel and Distributed Processing Symposium, 2014.
PDF available.
Hierarchical Computation in the SPMD Programming Model Amir Kamil and Katherine Yelick
26th International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2013.
PDF available.
Designing a Unified Programming Model for Heterogeneous Machines Michael Garland, Manjunath Kudlur and Yili Zheng
Proceedings of Supercomputing 2012.
PGAS for Distributed Numerical Python Targeting Multi-core Clusters Mads Kristensen, Yili Zheng and Brian Vinter
IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2012.
Publisher's page
Language Tutorial Materials
Programming in UPC (Unified Parallel C), CS267 Lecture, Spring 2006 Katherine Yelick, Tarek El-Ghazawi, Dan O Bonachea, Christian Bell, Wei Chen, Jason Duell, Paul Hargrove, Parry Husbands, Costin Iancu, Rajesh Nishtala, Mike Welcome
PDF slidesPPT slides
Optimized Collectives for PGAS Languages with One-Sided Communication, SuperComputing, Nov 2006 (SC06 Poster Session) Dan Bonachea, Rajesh Nishtala, Paul Hargrove, Mike Welcome, Kathy Yelick
PDF poster
Efficient Point-to-Point Synchronization in UPC Dan Bonachea, Rajesh Nishtala, Paul Hargrove, Katherine Yelick
2nd Conf. on Partitioned Global Address Space Programming Models
(PGAS06), October 2006
PDF slidesPPT slides
GASP: A Performance Tool Interface for Global Address Space Languages & Libraries, PARA, July 2006 Adam Leko, Dan Bonachea, Hung-Hsun Su
PDF slidesPPT slides
The Performance and Productivity Benefits of Global Address Space Languages, SuperComputing, Nov 2005 (SC05 Poster Session) Dan O Bonachea, Christian Bell, Rajesh Nishtala, Kaushik Datta, Parry Husbands, Paul Hargrove, Katherine Yelick
PDF poster