Tuning collective communication for Partitioned Global Address Space programming models Rajesh Nishtala, Yili Zheng, Paul Hargrove, Katherine A. Yelick
Parallel Computing 37(9): 576-591 (2011).
PDF
Hybrid PGAS Runtime Support for Multicore Nodes Filip Blagojevic, Paul Hargrove, Costin Iancu, and Katherine Yelick
Fourth Conference on Partitioned Global Address Space Programming Model (PGAS10), Oct 2010.
PDF
Oversubscription on multicore processors C. Iancu, S. Hofmeyr, F. Blagojevic, and Yili Zheng
24rd International Parallel & Distributed Processing Symposium (IPDPS), 2010.
PDF
Porting GASNet to Portals: Partitioned Global Address Space (PGAS) Language Support for the Cray XT
D. Bonachea, P. Hargrove, M. Welcome, K. Yelick.
Cray User Group (CUG), 2009.
Paper PDFSlides PDF
Scaling Communication-Intensive Applications on BlueGene/P
Using One-Sided Communication and Overlap
R. Nishtala, P. Hargrove, D. Bonachea, K. Yelick.
23rd International Parallel & Distributed Processing Symposium (IPDPS), 2009.
PDF
Optimizing Collective Communication on Multicores R. Nishtala and K. Yelick.
HotPar 2009, Berkeley, CA, USA, March 2009
PDF
Pthreads or Processes: Which is Better for Implementing Global Address Space Languages? Jason Duell
Masters Report, August 2007.
Computer Science Division, UC Berkeley
PDF
Optimizing Bandwidth Limited Problems Using One-Sided Communication and Overlap C. Bell, D. Bonachea, R. Nishtala, K. Yelick.
Lawrence Berkeley National Lab Tech Report LBNL-59207: PDF available.
20th International Parallel & Distributed Processing Symposium (IPDPS), 2006: PDF of the conference paper is available,
as well as PowerPoint slides (PPT)(PDF) from the presentation.
Evaluating Support for Global Address Space Languages on the Cray X1 C. Bell, W. Chen, D. Bonachea, K. Yelick.
ICS 2004.
PDF available.
An Evaluation of Current High-Performance Networks C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, K. Yelick.
17th International Parallel and Distributed Processing Symposium (IPDPS), 2003.
Abstract,
Postscript,
PDF of the paper are available, as well as PowerPoint slides from the presentation.
A New DMA Registration Strategy for Pinning-Based High Performance Networks C. Bell and D. Bonachea. Workshop on Communication Architecture for Clusters (CAC'03), 2003.
PDF of the paper available, as well as
PowerPoint slides from the presentation.
GASNet Specification, v1.1 D. Bonachea. U.C. Berkeley Tech Report CSD-02-1207.
PDF available.
For the most up-to-date version of the spec, see the GASNet web page.
Problems with using MPI 1.1 and 2.0 as compilation targets
for parallel language implementations
Dan Bonachea and Jason Duell.
2nd Workshop on Hardware/Software Support for High Performance Scientific and Engineering Computing, SHPSEC-PACT03
Postscript and
PDF of the paper available,
as well as PowerPoint slides from the presentation.
Journal version:
Int. J. High Performance Computing and Networking, Vol. 1, Nos. 1/2/3, pp.91-99, 2004. PDF available,
Automatic Nonblocking Communication for Partitioned Global Address Space Programs W. Chen, D. Bonachea, C. Iancu, K. Yelick.
International Conference on Supercomputing (ICS), 2007.
PDF
Communication Optimizations for Fine-grained UPC Applications W. Chen, C. Iancu, K. Yelick.
14th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2005.
Lawrence Berkeley National Lab Tech Report LBNL-58382
PDF
HUNTing the Overlap C. Iancu, P. Husbands, P. Hargrove.
14th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2005.
Lawrence Berkeley National Lab Tech Report LBNL-58381
PDF
Polynomial-time Algorithms for Enforcing Sequential Consistency in
SPMD Programs with Arrays W. Chen, A. Krishnamurthy, K. Yelick.
16th International Workshop on Languages and Compilers for Parallel
Computing (LCPC), 2003.
PDF
Building a Source-to-Source UPC-to-C Translator W. Chen
Masters Report, 2005.
PDF
Multithreaded Global Address Space Communication Techniques for Gyrokinetic Fusion Applications on Ultra-Scale Platforms Robert Preissl, Nathan Wichmann, Bill Long, John Shalf, Stephane Ethier, and Alice Koniges
Proceedings of Supercomputing 2011.
PDF
Optimizing UPC programs for multi-core systems Yili Zheng
Scientific Programming 18(3-4): 183-191 (2011).
PDF from publisher
A programming model performance study using the NAS parallel benchmarks Hongzhang Shan, Filip Blagojevic, Seung-Jai Min, Paul Hargrove, Haoqiang Jin, Karl Fuerlinger, Alice Koniges and Nicholas J. Wright
Scientific Programming 18(3-4): 153-167 (2011).
PDF from publisher
GASP! A Standardized Performance Analysis Tool Interface for Global Address Space Programming Models Hung-Hsun Su, Dan Bonachea, Adam Leko, Hans Sherburne, Max Billingsley III, Alan D. George
Lawrence Berkeley National Lab Tech Report LBNL-61659, 2006.
PDF
GASP: A Performance Analysis Tool Interface for Global Address Space Programming Models, Specification Version 1.5 Adam Leko, Dan Bonachea, Hung-Hsun Su, and Alan D. George
Lawrence Berkeley National Lab Tech Report LBNL-61606, Sep 14 2006.
PDF
Message Strip Mining Heuristics for High Speed Networks C. Iancu, P. Husbands, W. Chen.
VECPAR 2004.
Postscript
A Performance Analysis of the Berkeley UPC Compiler W. Chen, D. Bonachea, J. Duell, P. Husbands, C. Iancu, K. Yelick.
17th Annual International Conference on Supercomputing (ICS), 2003.
Postscript, PDF, and the presentation slides available.
Proposal for Extending the UPC Memory Copy Library Functions, v2.0 (March 22, 2007)
D. Bonachea
UPC Community Forum, 2007.
PDF available.
Same document with additional sections describing supporting GASNet extensions: PDF available.
The archival version for citation in publication is:
Proposal for Extending the UPC Memory Copy Library Functions and Supporting Extensions to GASNet, v2.0 (March 22, 2007)
Lawrence Berkeley National Lab Tech Report LBNL-56495 v2.0
PDF available.
A Proposal for a UPC Memory Consistency Model, v1.1 (Aug 30, 2004) K. Yelick, D. Bonachea and C. Wallace
PS and PDF available.
The archival version for citation in publication is:
A Proposal for a UPC Memory Consistency Model, v1.0 (May 5, 2004)
Lawrence Berkeley National Lab Tech Report LBNL-54983
PS and PDF available.
UPC-IO: A Parallel I/O API for UPC, v1.0 T. El-Ghazawi, F. Cantonnet, P. Saha, R. Thakur, R. Ross, D. Bonachea
UPC community forum, 2003.
PDF available.
Introduction to UPC and Language Specification W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren. CCS-TR-99-157,
IDA Center for Computing Sciences, 1999.
PDF available.
For the most up-to-date version of the spec, see the UPC Language Community web page.
Optimized Collectives for PGAS Languages with One-Sided Communication, SuperComputing, Nov 2006 (SC06 Poster Session) Dan Bonachea, Rajesh Nishtala, Paul Hargrove, Mike Welcome, Kathy Yelick
PDF poster
Efficient Point-to-Point Synchronization in UPC Dan Bonachea, Rajesh Nishtala, Paul Hargrove, Katherine Yelick
2nd Conf. on Partitioned Global Address Space Programming Models
(PGAS06), October 2006
PDF slidesPPT slides
GASP: A Performance Tool Interface for Global Address Space Languages & Libraries, PARA, July 2006 Adam Leko, Dan Bonachea, Hung-Hsun Su
PDF slidesPPT slides
The Performance and Productivity Benefits of Global Address Space Languages, SuperComputing, Nov 2005 (SC05 Poster Session) Dan O Bonachea, Christian Bell, Rajesh Nishtala, Kaushik Datta, Parry Husbands, Paul Hargrove, Katherine Yelick
PDF poster