Publications

2017

  • Roger Kowalewski and Karl Fürlinger. Debugging Latent Synchronization Errors in MPI-3 One-Sided Communication. In Christoph Niethammer, José Gracia, Tobias Hilbrich, Andreas Knüpfer, Michael M. Resch, and Wolfgang E. Nagel, editors, Tools for High Performance Computing 2016: Proceedings of the 10th International Workshop on Parallel Tools for High Performance Computing, October 2016, Stuttgart, Germany, pages 83-96. Springer International Publishing. Cham, 2017. (»BibTeX, »Online)
  • D. Unat, A. Dubey, T. Hoefler, J. Shalf, M. Abraham, M. Bianco, B. L. Chamberlain, R. Cledat, H. C. Edwards, H. Finkel, K. Fuerlinger, F. Hannig, E. Jeannot, A. Kamil, J. Keasler, P. H. J. Kelly, V. Leung, H. Ltaief, N. Maruyama, C. J. Newburn, and M. Pericas. Trends in Data Locality Abstractions for HPC Systems. IEEE Transactions on Parallel and Distributed Systems, volume PP (99), pages 1-1, 2017. (»BibTeX, »Online)

2016

  • Roger Kowalewski and Karl Fürlinger. Nasty-MPI: Debugging Synchronization Errors in MPI-3 One-Sided Applications. In Proceedings of the 22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016), pages 51-62. Grenoble, France, 2016. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger, Tobias Fuchs, and Roger Kowalewski. DASH: A C++ PGAS Library for Distributed Data Structures and Parallel Algorithms. In Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications (HPCC 2016), pages 983-990. Sydney, Australia, December 2016. (»BibTeX, »Preprint, »Online)
  • Tobias Fuchs and Karl Fürlinger. A Multi-Dimensional Distributed Array Abstraction for PGAS. In Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications (HPCC 2016), pages 1061-1068. Sydney, Australia, December 2016. (»BibTeX, »Preprint, »Online)
  • Xavier Aguilar, Karl Fürlinger, and Erwin Laure. Online MPI Trace Compression using Event Flow Graphs and Wavelets. In Proceedings of the 2016 International Conference on Computational Science, ICCS. San Diego, USA, June 2016. (»BibTeX, »Online)
  • Tobias Fuchs and Karl Fürlinger. Expressing and Exploiting Multidimensional Locality in DASH. In Hans-Joachim Bungartz, Philipp Neumann, and E. Wolfgang Nagel, editors, Software for Exascale Computing - SPPEXA 2013-2015, pages 341-359. Springer. Garching, Germany, 2016. (»BibTeX, »Preprint, »Online)
  • Denis Hünich, Andreas Knüpfer, Sebastian Oeste, Karl Fürlinger, and Tobias Fuchs. Tool Support for Developing DASH Applications. In Hans-Joachim Bungartz, Philipp Neumann, and E. Wolfgang Nagel, editors, Software for Exascale Computing - SPPEXA 2013-2015, pages 361-377. Springer. Garching, Germany, 2016. (»BibTeX, »Online)

2015

  • Karl Fürlinger. Exploiting Hierarchical Exascale Hardware Using a PGAS Approach. In Proceedings of the 3rd International Conference on Exascale Applications and Software, pages 48-52. University of Edinburgh. Edinburgh, Scotland, UK, 2015. (»BibTeX)
  • Xavier Aguilar, Karl Fürlinger, and Erwin Laure. Automatic On-Line Detection of MPI Application Structure with Event Flow Graphs. In Proceedings of the 21th International Euro-Par Conference on Parallel Processing (Euro-Par '15). Vienna, Austria, August 2015. (»BibTeX, »Online)
  • Xavier Aguilar, Karl Fürlinger, and Erwin Laure. Visual MPI Performance Analysis using Event Flow Graphs. In Proceedings of the 2015 International Conference on Computational Science, ICCS. Reykjavik, Iceland, June 2015. (»BibTeX, »Online)
  • J. Dampf, T. Pany, W. Bär, J. Winkel, C. Stöber, K. Fürlinger, P. Closas, and J. A. Garcia-Molina. More Than We Ever Dreamed Possible: Processor Technology for GNSS Software Receivers in the Year 2015. Inside GNSS, volume 10 (4), pages 62-72, Juli 2015. (»BibTeX)
  • T. Pany, J. Dampf, W. Bär, J. Winkel, C. Stöber, K. Fürlinger, P. Closas, and J.A. Garcia-Molina. Benchmarking CPUs and GPUs on Embedded Platforms for Software Receiver Usage. In Proceedings of the 28th International Technical Meeting of The Satellite Division of the Institute of Navigation (ION GNSS+ 2015), pages 3188 - 3197. Tampa, Florida, USA, September 2015. (»BibTeX, »Preprint)
  • Lei Zhou and Karl Fürlinger. DART-CUDA: A PGAS Runtime System for Multi-GPU Systems. In IEEE 14th International Symposium on Parallel and Distributed Computing (ISPDC). Limassol, Cyprus, June 2015. (»BibTeX, »Preprint, »Online)

2014

  • Karl Fürlinger, Colin Glass, Andreas Knüpfer, Jie Tao, Denis Hünich, Kamran Idrees, Matthias Maiterth, Yousri Mhedheb, and Huan Zhou. DASH: Data Structures and Algorithms with Support for Hierarchical Locality. In Euro-Par 2014 Workshops (Porto, Portugal), 2014. (»BibTeX, »Preprint, »Online)
  • Xavier Aguilar, Karl Fürlinger, and Erwin Laure. MPI Trace Compression using Event Flow Graphs. In Proceedings of the 20th International Euro-Par Conference on Parallel Processing (Euro-Par '14), 2014. (»BibTeX, »Online)
  • Huan Zhou, Yousri Mhedheb, Kamran Idrees, Colin Glass, Jose Gracia, Karl Fürlinger, and Jie Tao. DART-MPI: An MPI-based Implementation of a PGAS Runtime System. In The 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), October 2014. (»BibTeX, »Preprint, »Online)
  • Jiaqi Zhao, Jie Tao, and Karl Fürlinger. A Framework for Comparative Performance Study on Virtualized Machines. International Journal of Ad Hoc and Ubiquitous Computing, volume 17 (2/3), pages 82-99, 2014. (»BibTeX, »Online)

2013

  • Bronis R. de Supinski, Bettina Krammer, Karl Fürlinger, Jesús Labarta, and Dimitrios S. Nikolopoulos. Topic 1: Support Tools and Environments (Introduction). In Felix Wolf, Bernd Mohr, and Dieter an Mey, editors, Euro-Par 2013 Parallel Processing: 19th International Conference, Aachen, Germany, August 26-30, 2013. Proceedings, pages 3-3. Springer, 2013. (»BibTeX, »Online)
  • Xavier Aguilar, Erwin Laure, and Karl Fürlinger. Online Performance Data Introspection with IPM. In Proceedings of the 15th IEEE International Conference on High Performance Computing and Communications, 2013. (»BibTeX, »Online)

2012

  • Simone Ferlin Oliveira, Karl Fürlinger, and Dieter Kranzlmüller. Trends in Computation, Communication and Storage and the Consequences for Data-intensive Science. In Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, HPCC-ICESS 2012, pages 572-579, 2012. (»BibTeX, »Online)
  • Jie Tao, Karl Fürlinger, Lizhe Wang, and Holger Marten. A Performance Study of Virtual Machines on Multicore Architectures. In Proceedings of the 20th Euromicro International Conference on Parallel, Distributed and Network-Based Computing (PDP 2012). Garching, Germany, February 2012. (»BibTeX, »Preprint, »Online)

2011

  • Orlando Rivera, Karl Fürlinger, and Dieter Kranzlmüller. Investigating the Scalability of OpenFOAM for the Solution of Transport Equations and Large Eddy Simulations. In Proceedings of the 2011 International Symposium on Advances of Distributed Computing and Networking (ADCN 2011), in conjunction with ICA3PP 2011. Melbourne, Australia, October 2011. (»BibTeX, »Online)
  • Karl Fürlinger, Christof Klausecker, and Dieter Kranzlmüller. The AppleTV-Cluster: Towards Energy Efficient Parallel Computing on Consumer Electronic Devices. April 2011. Whitepaper v1.0, April 2011. (»BibTeX, »Preprint)
  • Nicholas J. Wright, Hongzhang Shan, Filip Blagojevic, Harvey Wasserman, Tony Drummond, John Shalf, Karl Fürlinger, Katherine Yelick, Stephane Ethier, Marcus Wagner, Nathan Wichman, Sarah Anderson, and Mike Aamodt. The NERSC-Cray Center of Excellence: Performance Optimization for the Multicore Era. In Cray User's Group Meeting 2011. Fairbanks, AK, USA, May 2011. (»BibTeX)
  • Orlando Rivera and Karl Fürlinger. Parallel Aspects of OpenFOAM with Large Eddy Simulations. In Proceedings of the 13th IEEE International Conference on High Performance Computing and Communications (HPCC-11). Banff, Canada, September 2011. (»BibTeX, »Online)
  • Karl Fürlinger, Christof Klausecker, and Dieter Kranzlmüller. Towards Energy Efficient Parallel Computing on Consumer Electronic Devices. In ICT-GLOW 2011. Toulouse, France, September 2011. (»BibTeX, »Online)
  • Karl Fürlinger, Nicholas J. Wright, and David Skinner. Comprehensive Performance Monitoring for GPU Cluster Systems. In Proceedings of the 12th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), in conjunction with IPDPS-11. Anchorage, Alaska, USA, May 2011. (»BibTeX, »Preprint, »Online)
  • Jie Tao, Karl Fürlinger, and Holger Marten. Performance Evaluation of OpenMP Applications on Virtualized Multicore Machines. In Proceedings of the 7th International Workshop on OpenMP (IWOMP 2011). Chicago, IL, USA, June 2011. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger. OpenMP Profiling with ompP. In David Padua, editor, Encyclopedia of Parallel Computing, 2011. (»BibTeX, »Online)

2010

  • Karl Fürlinger, Nicholas J. Wright, David Skinner, Christof Klausecker, and Dieter Kranzlmüller. Effective Holistic Performance Measurement at Petascale Using IPM. In Proceedings of CiHPC: Competence in High Performance Computing. Schwetzingen, Germany, May 2010. (»BibTeX, »Online)
  • Hongzhang Shan, Haoqiang Jin, Karl Fürlinger, Alice Koniges, and Nicholas J. Wright. Analyzing the Effect of Different Programming Models upon Performance and Memory Usage on Cray XT5 Platforms. In Cray User's Group Meeting 2010. Edinburgh, May 2010. (»BibTeX, »Preprint)
  • Karl Fürlinger. OpenMP Application Profiling - State of the Art and Directions for the Future. In Proceedings of the 2010 International Conference on Computational Science (ICCS 2010). Amsterdam, NL, May 2010. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger, Nicholas J. Wright, and David Skinner. Effective Performance Measurement at Petascale Using IPM. In Proceedings of The Sixteenth IEEE International Conference on Parallel and Distributed Systems (ICPADS 2010). Shanghai, China, December 2010. (»BibTeX, »Preprint, »Online)

2009

  • Karl Fürlinger and David Skinner. Capturing and Visualizing Event Flow Graphs of MPI Applications. In Workshop on Productivity and Performance (PROPER 2009) in conjunction with Euro-Par 2009, August 2009. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Shirley Moore. Recording the Control Flow of Parallel Applications to Determine Iterative and Phase-Based Behavior. Future Generation Computing Systems (FGCS), volume 26 (1), pages 162-166, 2009. (»BibTeX, »Online)
  • Karl Fürlinger and Shirley Moore. Capturing and Analyzing the Execution Control Flow of OpenMP Applications. International Journal of Parallel Programming (IJPP), volume 37 (3), pages 266-276, 2009. (»BibTeX, »Online)
  • Karl Fürlinger and David Skinner. Performance Profiling for OpenMP Tasks. In Proceedings of the 5th International Workshop on OpenMP (IWOMP 2009), pages 132-139. Dresden, Germany, June 2009. LNCS 5568. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger, Nicholas J. Wright, and David Skinner. Performance Analysis and Workload Characterization with IPM. In Proceedings of the 3rd International Workshop on Parallel Tools for High Performance Computing. Dresden, September 2009. (»BibTeX, »Online)

2008

  • Karl Fürlinger and Shirley Moore. OpenMP-centric Performance Analysis of Hybrid Applications. In Proceedings of the 2008 IEEE International Conference on Cluster Computing (CLUSTER 2008), pages 160-166. Tsukuba, Japan, September 2008. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger, Dan Terpstra, Haihang You, Phil Mucci, and Shirley Moore. Enabling Data Structure Oriented Performance Analysis with Hardware Performance Counter Support. In Workshop on Productivity and Performance (PROPER 2008) in conjunction with Euro-Par 2008, pages 263-272, August 2008. LNCS 5415. (»BibTeX, »Online)
  • Felix Wolf, Brian Wylie, Erika Abraham, Daniel Becker, Wolfgrang Frings, Karl Fürlinger, Markus Geimer, Marc-Andre Hermanns, Bernd Mohr, Shirley Moore, and Zoltan Szebenyi. Usage of the SCALASCA Toolset for Scalable Performance Analysis of Large-Scale Parallel Applications. In Proceedings of the 2nd HLRS Parallel Tools Workshop. Stuttgart, Germany, Juli 2008. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Shirley Moore. Detection and Analysis of Iterative Behavior in Parallel Applications. In Proceedings of the 2008 International Conference on Computational Science (ICCS 2008), pages 261-267. Krakow, Poland, June 2008. LNCS 5103. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Shirley Moore. Visualizing the Program Execution Control Flow of OpenMP Applications. In Proceedings of the 4th International Workshop on OpenMP (IWOMP 2008), pages 181-190. Purdue, Indiana, USA, May 2008. LNCS 5004. (»BibTeX, »Preprint, »Online)

2007

  • Michael Gerndt and Karl Fürlinger. Specification and detection of performance problems with ASL. Concurrency and Computation: Practice and Experience, volume 19 (11), pages 1451-1464. John Wiley and Sons Ltd. 2007. (»BibTeX, »Online)
  • Karl Fürlinger and Jack Dongarra. On Using Incremental Profiling for the Performance Analysis of Shared Memory Parallel Applications. In Proceedings of the 13th International Euro-Par Conference on Parallel Processing (Euro-Par '07), pages 62-71, August 2007. LNCS 4641. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger, Michael Gerndt, and Jack Dongarra. Scalability Analysis of the SPEC OpenMP Benchmarks on Large-Scale Shared Memory Multiprocessors. In Proceedings of the 2007 International Conference on Computational Science (ICCS 2007), pages 815-822. Beijing, CN, May 2007. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Shirley Moore. Continuous Runtime Profiling of OpenMP Applications. In Proceedings of the 2007 Conference on Parallel Computing (PARCO 2007), pages 677-686, September 2007. (»BibTeX, »Preprint)
  • Michael Gerndt and Karl Fürlinger. Highly Scalable Performance Analysis Tools. In Petascale Computing: Algorithms and Applications. CRC Press, 2007. (»BibTeX, »Online)

2006

  • Karl Fürlinger and Michael Gerndt. Finding Inefficiencies in OpenMP Applications Automatically with Periscope. In Proceedings of the 2006 International Conference on Computational Science (ICCS 2006) (Vol. 2), pages 494-501. Reading, UK, May 2006. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Michael Gerndt. Automated Performance Analysis using ASL Performance Properties. In Proceedings of the 2006 Workshop on the State-of-the-Art in Scientific and Parallel Computing (PARA'06), pages 390-397. Umea, Sweden, June 2006. LNCS 4699. (»BibTeX, »Online)
  • Karl Fürlinger and Michael Gerndt. Analyzing Overheads and Scalability Characteristics of OpenMP Applications. In Proceedings of the Seventh International Meeting on High Performance Computing for Computational Science (VECPAR'06), pages 39-51. Rio de Janeiro, Brazil, Juli 2006. LNCS 4395. (»BibTeX, »Preprint, »Online)

2005

  • Karl Fürlinger and Michael Gerndt. Performance Analysis of Shared-Memory Parallel Applications using Performance Properties. In Proceedings of the 2005 International Conference on High Performance Computing and Communications (HPCC-05), pages 595-604. Sorrento, Italy, September 2005. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Michael Gerndt. Periscope: Performance Analysis on Large-Scale Systems. InSiDE -- Innovatives Supercomputing in Deutschland (Featured Article), volume 3 (2, Autumn), pages 26-29, 2005. (»BibTeX)
  • Karl Fürlinger and Michael Gerndt. ompP: A Profiling Tool for OpenMP. In Proceedings of the First and Second International Workshops on OpenMP (IWOMP 2005, IWOMP 2006), pages 15-23. Eugene, Oregon, USA, May 2005. LNCS 4315. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Michael Gerndt. A Lightweight Dynamic Application Monitor for SMP Clusters. In Siegfried Wagner, Werner Hanke, Arndt Bode, and Franz Durst, editors, High Performance Computing in Science and Engineering, Munich 2004: Transactions of the Second Joint HLRB and KONWIHR Status and Result Workshop, March 2--3, 2004, Technical University of Munich, and Leibniz-Rechenzentrum Munich, Germany, pages 27-36. Springer Berlin Heidelberg. Berlin, Heidelberg, 2005. (»BibTeX, »Online)
  • Michael Gerndt, Karl Fürlinger, and Edmond Kereku. Periscope: Advanced Techniques for Performance Analysis. In Proceedings of the 2005 International Conference on Parallel Computing (ParCo 2005), pages 15-26. Malaga, Spain, September 2005. Invited paper. (»BibTeX)

2004

  • Karl Fürlinger, Olaf Schenk, and Michael Hagemann. Task-Queue Based Hybrid Parallelism: A Case Study. In Proceedings of the 10th International Euro-Par Conference on Parallel Processing (Euro-Par '04), pages 624-632, 2004. (»BibTeX, »Preprint, »Online)
  • Karl Fürlinger and Michael Gerndt. Peridot: Towards Automated Runtime Detection of Performance Bottlenecks. In High Performance Computing in Science and Engineering, Garching 2004, pages 193-202. Springer, 2004. (»BibTeX)

2003

  • Michael Gerndt, Karl Fürlinger, and Andreas C. Schmidt. Towards Automatic Performance Analysis for Large Scale Systems. In International Workshop on Compilers for Parallel Computers (CPC-2003). Amsterdam, The Netherlands, January 2003. (»BibTeX)
  • Karl Fürlinger and Michael Gerndt. Distributed Application Monitoring for Clustered SMP Architectures.. In Harald Kosch, László Böszörményi, and Hermann Hellwagner, editors, Proceedings of the 9th International Euro-Par Conference on Parallel Processing, pages 127-134. Springer. Klagenfurt, Austria, August 2003. (»BibTeX, »Online)
  • Karl Fürlinger and Michael Gerndt. Distributed Configurable Application Monitoring on SMP Clusters.. In Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, editors, Proceedings of the 10th European PVM/MPI Users' Group Meeting, pages 429-437. Springer. Venice, Italy, September 2003. (»BibTeX, »Online)
  • Thomas Pany, Markus Irsigler, Bernd Eisfeller, and Karl Fürlinger. Performance Assessment of an Under-Sampling SWC Receiver for Simulated High-Bandwidth GPS/Galileo Signals and Real Signals. In Proceedings of the 16th International Technical Meeting of the Satellite Division of the Institute of Navigation (ION GPS/GNSS 2003). Portland, Oregon, September 2003. (»BibTeX)

1999

  • H. A. Mayer, K. Fürlinger, and M. Strapetz. Extraction of compact rule sets from evolutionary designed artificial neural networks. In Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proc eedings. 1999 IEEE International Conference on, volume 1, pages 420-424 vol.1, 1999. (»BibTeX, »Online)