document.write('
Jump to year:
');
document.write('[2023]');
document.write(' ');
document.write('[2022]');
document.write(' ');
document.write('[2021]');
document.write(' ');
document.write('[2020]');
document.write(' ');
document.write('[2019]');
document.write(' ');
document.write('[2018]');
document.write(' ');
document.write('[2017]');
document.write('
');
document.write('[2016]');
document.write(' ');
document.write('[2015]');
document.write(' ');
document.write('[2014]');
document.write(' ');
document.write('[2013]');
document.write(' ');
document.write('[2012]');
document.write(' ');
document.write('[2011]');
document.write(' ');
document.write('[2010]');
document.write('
');
document.write('[2009]');
document.write(' ');
document.write('[2008]');
document.write(' ');
document.write('[2007]');
document.write(' ');
document.write('[2006]');
document.write(' ');
document.write('[2005]');
document.write(' ');
document.write('[2004]');
document.write(' ');
document.write('[2003]');
document.write('
');
document.write('[2002]');
document.write(' ');
document.write('[2001]');
document.write(' ');
document.write('[2000]');
document.write(' ');
document.write('[1999]');
document.write('');
document.write('
');
document.write('List of all BibTeX entries:
');
document.write('[BibTeX Entries]
');
document.write('
Christian Hellwig, Fabian Czappa, Martin Michel, Reinhold Bertrand, Felix Wolf: Satellite Collision Detection using Spatial Data Structures. In Proc. of the 37th IEEE International Parallel and Distributed Processing Symposium (IPDPS), St. Petersburg, Florida, USA, pages 1–11, May 2023.
PDF BibTeX
Jean-Baptiste Besnard, Ahmad Tarraf, Clément Barthélemy, Alberto Cascajo, Emmanuel Jeannot, Sameer S. Shende, Felix Wolf: Towards Smarter Schedulers: Molding Jobs into the Right Shape via Monitoring and Modeling. In Proc. of the 2nd International Workshop on Malleability Techniques Applications in High-Performance Computing (HPCMALL 2023), held in conjunction with the ISC High Performance Conference (ISC), Hamburg, Germany, May 2023, (accepted).
BibTeX
Fabian Czappa, Alexander Geiß, Felix Wolf: Simulating Structural Plasticity of the Brain more Scalable than Expected. Journal of Parallel and Distributed Computing, 171:24–27, January 2023.
URL DOI BibTeX
Marcus Ritter, Ahmad Tarraf, Alexander Geiß, Nour Daoud, Bernd Mohr, Felix Wolf: Conquering Noise With Hardware Counters on HPC Systems. In Proc. of the Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC22), pages 1–10, IEEE, 2022.
PDF DOI BibTeX
Hannah Nöttgen, Fabian Czappa, Felix Wolf: Accelerating Brain Simulations with the Fast Multipole Method. In Proc. of the 28th Euro-Par Conference 2022: Parallel Processing, Glasgow, UK of Lecture Notes in Computer Science, pages 387–402, Springer, August 2022.
PDF DOI BibTeX
Taylan Özden, Tim Beringer, Arya Mazaheri, Hamid Mohammadi Fard, Felix Wolf: ElastiSim: A Batch-System Simulator for Malleable Workloads. In Proc. of the 51st International Conference on Parallel Processing (ICPP), Bordeaux, France, pages 1–11, ACM, August 2022.
PDF DOI BibTeX
Angelina Horn, Hamid Mohammadi Fard, Felix Wolf: Multi-objective Hybrid Autoscaling of Microservices in Kubernetes Clusters. In Proc. of the 28th Euro-Par Conference: Parallel Processing, Glasgow, UK, volume 13440 of Lecture Notes in Computer Science, pages 233–250, Springer, August 2022.
PDF URL DOI BibTeX
Sushil Prasad, Sheikh Ghafoor, Martina Barnas, Felix Wolf, Erik Saule, Noemi Rodriguez, Rizos Sakellariou (eds.): Editorial of Special Issue: Keeping up with technology: Teaching parallel, distributed, and high-performance computing. Journal of Parallel and Distributed Computing, 160:36–38, February 2022.
DOI BibTeX
Fabian Czappa, Alexandru Calotoiu, Thomas Höhl, Heiko Mantel, Toni Nguyen, Felix Wolf: Design-Time Performance Modeling of Compositional Parallel Programs. Parallel Computing, 108:1–12, September 2021.
PDF URL DOI BibTeX
Jan-Patrick Lehr, Christian Bischof, Florian Dewald, Heiko Mantel, Mohammad Norouzi, Felix Wolf: Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization. In Proc. of the 50th International Conference on Parallel Processing (ICPP), Chicago, Illinois, USA, pages 1–10, ACM, August 2021.
DOI BibTeX
Dmitry A. Nikitenko, Felix Wolf, Bernd Mohr, Torsten Hoefler, Konstantin S. Stefanov, Vadim Vladimirovich Voevodin, Aleksandr Sergeevich Antonov, Alexandru Calotoiu: Influence of Noisy Environments on Behavior of HPC Applications. Lobachevskii Journal of Mathematics, 42(7):1560–1570, July 2021.
URL DOI BibTeX
Rahim Mammadli, Marija Selakovic, Felix Wolf, Michael Pradel: Learning to Make Compiler Optimizations More Effective. In Proc. of the 5th ACM SIGPLAN International Symposium on Machine Programming (MAPS ’21), pages 9–20, ACM, June 2021.
PDF DOI BibTeX
Marcus Ritter, Alexander Geiß, Johannes Wehrstein, Alexandru Calotoiu, Thorsten Reimann, Torsten Hoefler, Felix Wolf: Noise-Resilient Empirical Performance Modeling with Deep Neural Networks. In Proc. of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Portland, Oregon, USA, pages 23–34, IEEE, May 2021.
PDF DOI BibTeX
Felix Wolf, Wanling Gao (eds.), Benchmarking, Measuring, and Optimizing - Proc. of the 3rd BenchCouncil International Symposium (Bench 2020), volume 12614 of Lecture Notes in Computer Science, Springer, March 2021.
DOI BibTeX
Marcin Copik, Alexandru Calotoiu, Tobias Grosser, Nicolas Wicki, Felix Wolf, Torsten Hoefler: Extracting Clean Performance Models from Tainted Programs. In Proc. of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Seoul, South Korea, pages 403–417, ACM, February 2021.
URL DOI BibTeX
Alexandru Calotoiu, Marcin Copik, Torsten Hoefler, Marcus Ritter, Sergei Shudler, Felix Wolf: Software for Exascale Computing - SPPEXA 2016-2019, chapter ExtraPeak: Advanced Automatic Performance Modeling for HPC Applications. Springer, pages 453–482, 2020.
DOI BibTeX
Reiner Hähnle, Asmae Heydari Tabar, Arya Mazaheri, Mohammad Norouzi, Dominic Steinhöfel, Felix Wolf: Safer Parallelization. In Proc. of the 9th International Symposium On Leveraging Applications of Formal Methods, Verification and Validation: Engineering Principles. ISoLA 2020, Rhodes, Greece, volume 1477 of Lecture Notes in Computer Science, pages 117–137, Springer, 2020.
DOI BibTeX
Hamid Mohammadi Fard, Radu Prodan, Felix Wolf: Dynamic Multi-objective Scheduling of Microservices in the Cloud. In Proc. of 2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC), Leicester, UK, pages 386–393, IEEE, December 2020.
PDF DOI BibTeX
Rahim Mammadli, Ali Jannesari, Felix Wolf: Static Neural Compiler Optimization via Deep Reinforcement Learning. In Proc. of the 6th Workshop on the LLVM Compiler Infrastructure in HPC, held in conjunction with the Supercomputing Conference (SC20), pages 1–11, IEEE, November 2020.
PDF DOI BibTeX
Alexandru Calotoiu, Markus Geisenhofer, Florian Kummer, Marcus Ritter, Jens Weber, Torsten Hoefler, Martin Oberlack, Felix Wolf: Empirical Modeling of Spatially Diverging Performance. In Proc. of the Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC20), pages 1–10, IEEE, November 2020.
PDF DOI BibTeX
Fabian Schrammel, Florian Renk, Arya Mazaheri, Felix Wolf: Efficient Ephemeris Models for Spacecraft Trajectory Simulations on GPUs. In Proc. of the 26th Euro-Par Conference, Warsaw, Poland, volume 12247 of Lecture Notes in Computer Science, pages 561–577, Springer, August 2020.
PDF DOI BibTeX
Nicolas Morew, Mohammad Norouzi, Ali Jannesari, Felix Wolf: Skipping Non-essential Instructions Makes Data-Dependence Profiling Faster. In Proc. of the 26th Euro-Par Conference, Warsaw, Poland, volume 12247 of Lecture Notes in Computer Science, pages 3–17, Springer, August 2020.
PDF DOI BibTeX
Marcus Ritter, Alexandru Calotoiu, Sebastian Rinke, Thorsten Reimann, Torsten Hoefler, Felix Wolf: Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling. In Proc. of the 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, pages 884–895, IEEE, May 2020.
PDF DOI BibTeX
Arya Mazaheri, Tim Beringer, Matthew Moskewicz, Felix Wolf, Ali Jannesari: Accelerating Winograd Convolutions using Symbolic Computation and Meta-programming. In Proc. of the 15th EuroSys Conference, Heraklion, Crete, Greece, pages 1–14, ACM, April 2020.
PDF DOI BibTeX
Jan-Patrick Lehr, Alexandru Calotoiu, Christian Bischof, Felix Wolf: Automatic Instrumentation Refinement for Empirical Performance Modeling. In Proc. of the Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC19), Denver, CO, USA, pages 40–47, November 2019.
PDF DOI BibTeX
Alexandru Calotoiu, Thomas Höhl, Heiko Mantel, Toni Nguyen, Felix Wolf: Designing Efficient Parallel Software via Compositional Performance Modeling. In Proc. of the Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC19), Denver, CO, USA, pages 17–24, November 2019.
PDF DOI BibTeX
Hamid Mohammadi Fard, Radu Prodan, Felix Wolf: A Container-driven Approach for Resource Provisioning in Edge-Fog Cloud. In Proc. of the 5th International Symposium on Algorithmic Aspects of Cloud Computing (ALGOCLOUD 2019), Munich, Germany, pages 59–76, Springer, September 2019.
PDF DOI BibTeX
Sergei Shudler, Yannick Berens, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf: Engineering Algorithms for Scalability through Continuous Validation of Performance Expectations. IEEE Transactions on Parallel and Distributed Systems, 30(8):1768–1785, August 2019.
PDF DOI BibTeX
Arya Mazaheri, Johannes Schulte, Matthew Moskewicz, Felix Wolf, Ali Jannesari: Enhancing the Programmability and Performance Portability of GPU Tensor Operations. In Proc. of the 25th Euro-Par Conference, Göttingen, Germany, volume 11725 of Lecture Notes in Computer Science, pages 213–226, Springer, August 2019, (best paper award).
PDF DOI BibTeX
Mohammad Norouzi, Qamar Ilias, Ali Jannesari, Felix Wolf: Accelerating Data-Dependence Profiling with Static Hints. In Proc. of the 25th Euro-Par Conference, Göttingen, Germany, volume 11725 of Lecture Notes in Computer Science, pages 17–28, Springer, August 2019.
PDF DOI BibTeX
Aamer Shah, Chihsong Kuo, Akihiro Nomura, Satoshi Matsuoka, Felix Wolf: How File-access Patterns Influence the Degree of I/O Interference between Cluster Applications. Supercomputing Frontiers and Innovations, 6(2):29–55, July 2019.
PDF DOI BibTeX
Mohammad Norouzi, Felix Wolf, Ali Jannesari: Automatic Construct Selection and Variable Classification in OpenMP. In Proc. of the International Conference on Supercomputing (ICS), Phoenix, AZ, USA, pages 330–341, ACM, June 2019.
PDF DOI BibTeX
Leah E. Lackner, Hamid Mohammadi Fard, Felix Wolf: Efficient Job Scheduling for Clusters with Shared Tiered Storage. In Proc. of the 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Larnaca, Cyprus, pages 321–330, IEEE, May 2019.
PDF DOI BibTeX
Rohit Atre, Zia Ul Huda, Felix Wolf, Ali Jannesari: Dissecting Sequential Programs for Parallelization - An Approach Based on Computational Units. Concurrency and Computation: Practice and Experience, 31(5):1–12, March 2019.
PDF DOI BibTeX
Rahim Mammadli, Felix Wolf, Ali Jannesari: The Art of Getting Deep Neural Networks in Shape. ACM Transactions on Architecture and Code Optimization (TACO), 15(4):62:1–62:21, January 2019.
PDF DOI BibTeX
Gabriele Mencagli, Dora B. Heras, Valeria Cardellini, Emiliano Casalicchio, Emmanuel Jeannot, Felix Wolf, Antonio Salis, Claudio Schifanella, Ravi Reddy Manumachu, Laura Ricci, Marco Beccuti, Laura Antonelli, José Daniel Garcia Sanchez, Stephen L. Scott (eds.), Euro-Par 2018: Parallel Processing Workshops, volume 11339 of Lecture Notes in Computer Science, Springer, January 2019.
URL BibTeX
Philip C. Roth, Kevin Huck, Ganesh Gopalakrishnan, Felix Wolf: Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges. In Proc. of the 5th Workshop on Visual Performance Analysis (VPA), held in conjunction with the Supercomputing Conference (SC18), Dallas, TX, USA, volume 11027 of Lecture Notes in Computer Science, pages 265–272, Springer, November 2018.
PDF DOI BibTeX
Sergei Shudler, Jadran Vrabec, Felix Wolf: Understanding the Scalability of Molecular Simulation using Empirical Performance Modeling. In Proc. of the 7th Workshop on Extreme Scale Programming Tools (ESPT), held in conjunction with the Supercomputing Conference (SC18), Dallas, TX, USA, volume 11027 of Lecture Notes in Computer Science, pages 125–143, Springer, November 2018.
PDF DOI BibTeX
Sebastian Rinke, Markus Butz-Ostendorf, Marc-André Hermanns, Mikaël Naveau, Felix Wolf: A Scalable Algorithm for Simulating the Structural Plasticity of the Brain. Journal of Parallel and Distributed Computing, 120:251–266, October 2018.
PDF DOI BibTeX
Michael Burger, Christian Bischof, Alexandru Calotoiu, Felix Wolf, Thomas Wunderer, Johannes Buchmann: Exploring the Performance Envelope of the LLL Algorithm. In CSE 2018 - 21st IEEE International Conference of Computational Science and Engineering, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Romania, pages 36–43, IEEE, October 2018.
PDF DOI BibTeX
Alexandru Calotoiu, Alexander Graf, Torsten Hoefler, Daniel Lorenz, Sebastian Rinke, Felix Wolf: Lightweight Requirements Engineering for Exascale Co-design. In Proc. of the 2018 IEEE International Conference on Cluster Computing (CLUSTER), Belfast, UK, pages 201–211, IEEE, September 2018.
PDF DOI BibTeX
Aamer Shah, Matthias S. Müller, Felix Wolf: Estimating the Impact of External Interference on Application Performance. In Proc. of the 24th Euro-Par Conference, Turin, Italy, volume 11014 of Lecture Notes in Computer Science, pages 46–58, Springer, August 2018.
PDF DOI BibTeX
Arya Mazaheri, Felix Wolf, Ali Jannesari: Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics. In Proc. of the 47th International Conference on Parallel Processing (ICPP), Eugene, OR, USA, pages 6:1–6:10. ACM, August 2018.
PDF DOI BibTeX
Suraj Prabhakaran, Marcel Neumann, Felix Wolf: Efficient Fault Tolerance through Dynamic Node Replacement. In Proc. of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Washington, DC, USA, pages 163–172, IEEE, May 2018.
PDF DOI BibTeX
Sebastian Rinke, Mikaël Naveau, Felix Wolf, Markus Butz-Ostendorf: The Rewiring Brain: A Computational Approach to Structural Plasticity in the Adult Brain, chapter Critical Periods Emerge from Homeostatic Structural Plasticity in a Full-Scale Model of the Developing Cortical Column. Academic Press, San Diego, pages 177–202, 2017.
BibTeX
Marc-André Hermanns, Markus Geimer, Bernd Mohr, Felix Wolf: Trace-based Detection of Lock Contention in MPI One-Sided Communication. In Tools for High Performance Computing 2016, Proc. of the 10th Parallel Tools Workshop, Stuttgart, Germany, October 2016, pages 97–114, Springer, 2017.
URL DOI BibTeX
Bernd Mohr, Felix Wolf: The Virtual Institute – High-Productivity Supercomputing Celebrates its 10th Anniversary. Innovatives Supercomputing in Deutschland (inSiDE), 15(2):40–41, 2017.
URL BibTeX
Patrick Reisert, Alexandru Calotoiu, Sergei Shudler, Felix Wolf: Following the Blind Seer – Creating Better Performance Models Using Less Information. In Proc. of the 23rd Euro-Par Conference, Santiago de Compostela, Spain, volume 10417 of Lecture Notes in Computer Science, pages 106–118, Springer, August 2017.
PDF DOI BibTeX
Kashif Ilyas, Alexandru Calotoiu, Felix Wolf: Off-Road Performance Modeling – How to Deal with Segmented Data. In Proc. of the 23rd Euro-Par Conference, Santiago de Compostela, Spain, volume 10417 of Lecture Notes in Computer Science, pages 36–48, Springer, August 2017.
PDF DOI BibTeX
Rohit Atre, Ali Jannesari, Felix Wolf: Meeting the challenges of parallelizing sequential programs. In Proc. of the 29th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), Washington, DC, USA, pages 363–365, ACM, July 2017.
PDF DOI BibTeX
Ali Jannesari, Zia Ul Huda, Rohit Atre, Zhen Li, Felix Wolf: Parallelizing Audio Analysis Applications - A Case Study. In Proc. of the 39th International Conference on Software Engineering, Software Engineering Education and Training Track (ICSE-SEET), pages 57–66, May 2017.
PDF DOI BibTeX
Ali Jannesari, Felix Wolf, Walter Tichy (eds.): Special Issue on Software Engineering for Parallel Systems. Journal of Systems and Software, 125:380–448, March 2017.
DOI BibTeX
Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf: Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. In Proc. of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Austin, TX, USA, pages 131–143, ACM, February 2017.
PDF DOI BibTeX
Sebastian Rinke, Markus Butz-Ostendorf, Marc-André Hermanns, Mikaël Naveau, Felix Wolf: A Scalable Algorithm for Simulating the Structural Plasticity of the Brain. In Proc. of the 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Los Angeles, CA, USA, pages 1–8, October 2016.
PDF DOI BibTeX
Alexandru Calotoiu, David Beckingsale, Christopher W. Earl, Torsten Hoefler, Ian Karlin, Martin Schulz, Felix Wolf: Fast Multi-Parameter Performance Modeling. In Proc. of the 2016 IEEE International Conference on Cluster Computing (CLUSTER), Taipei, Taiwan, pages 172–181, IEEE, September 2016.
PDF DOI BibTeX
Felix Wolf, Christian Bischof, Alexandru Calotoiu, Torsten Hoefler, Christian Iwainsky, Grzegorz Kwasniewski, Bernd Mohr, Sergei Shudler, Alexandre Strube, Andreas Vogel, Gabriel Wittum: Software for Exascale Computing - SPPEXA 2013-2015, chapter Automatic Performance Modeling of HPC Applications. Springer, pages 445–465, September 2016.
DOI BibTeX
Andreas Vogel, Alexandru Calotoiu, Arne Nägel, Sebastian Reiter, Alexandre Strube, Gabriel Wittum, Felix Wolf: Software for Exascale Computing - SPPEXA 2013-2015, chapter Automated Performance Modeling of the UG4 Simulation Framework. Springer, pages 467–481, September 2016.
PDF DOI BibTeX
Zhen Li, Rohit Atre, Zia Ul Huda, Ali Jannesari, Felix Wolf: Unveiling Parallelization Opportunities in Sequential Programs. Journal of Systems and Software, 117:282–295, July 2016.
PDF DOI BibTeX
David Böhme, Markus Geimer, Lukas Arnold, Felix Voigtländer, Felix Wolf: Identifying the root causes of wait states in large-scale parallel applications. ACM Transactions on Parallel Computing, 3(2):Article No. 11, 24 pages, July 2016.
PDF DOI BibTeX
Zia Ul Huda, Rohit Atre, Ali Jannesari, Felix Wolf: Automatic Parallel Pattern Detection in the Algorithm Structure Design Space. In Proc. of the 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, USA, pages 43–52, IEEE, May 2016.
PDF DOI BibTeX
Ali Jannesari, Felix Wolf: Automatic Generation of Unit Tests for Correlated Variables in Parallel Programs. International Journal of Parallel Programming (IJPP), 44(3):644–662, March 2016.
PDF DOI BibTeX
Monika Harlacher, Alexandru Calotoiu, John Dennis, Felix Wolf: Analysing the Scalability of Climate Codes Using New Features of Scalasca. In Proc. of the John von Neumann Institute for Computing (NIC) Symposium 2016, Juelich, Germany, volume 48 of NIC Series, pages 343–352. Forschungszentrum Jülich, John von Neumann-Institut for Computing, February 2016.
BibTeX
Zhen Li, Rohit Atre, Zia Ul-Huda, Ali Jannesari, Felix Wolf: DiscoPoP: A Profiling Tool to Identify Parallelization Opportunities. In Tools for High Performance Computing 2014, Proc. of the 8th Parallel Tools Workshop, Stuttgart, Germany, October 2014, chapter 3, pages 37–54, Springer, 2015.
PDF DOI BibTeX
Laura von Rüden, Marc-André Hermanns, Michael Behrisch, Daniel Keim, Bernd Mohr, Felix Wolf: Separating the Wheat from the Chaff: Identifying Relevant and Similar Performance Data with Visual Analytics. In Proc. of the 2nd Workshop on Visual Performance Analysis (VPA), held in conjunction with the Supercomputing Conference (SC15), Austin, TX, USA, pages 4:1–4:8, ACM, 2015.
PDF DOI BibTeX
Zhen Li, Bo Zhao, Ali Jannesari, Felix Wolf: Beyond Data Parallelism: Identifying Parallel Tasks in Sequential Programs. In Proc. of 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Zhangjiajie, China, volume 9531 of Lecture Notes in Computer Science, pages 569–582, Springer, November 2015.
PDF DOI BibTeX
Zhen Li, Michael Beaumont, Ali Jannesari, Felix Wolf: Fast Data-Dependence Profiling by Skipping Repeatedly Executed Memory Operations. In Proc. of 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Zhangjiajie, China, volume 9531 of Lecture Notes in Computer Science, pages 583–596, Springer, November 2015.
PDF DOI BibTeX
Daniel Lorenz, Sergei Shudler, Felix Wolf: Preventing the explosion of exascale profile data with smart thread-level aggregation. In Proc. of the 4th Workshop on Extreme Scale Programming Tools (ESPT), held in conjunction with the Supercomputing Conference (SC15), Austin, TX, USA, pages 1–10, ACM, November 2015.
PDF DOI BibTeX
Arya Mazaheri, Ali Jannesari, Abdolreza Mirzaei, Felix Wolf: Characterizing Loop-Level Communication Patterns in Shared Memory Applications. In Proc. of the 44th International Conference on Parallel Processing (ICPP), Beijing, China, pages 759–768, September 2015.
PDF DOI BibTeX
Andreas Vogel, Alexandru Calotoiu, Alexandre Strube, Sebastian Reiter, Arne Nägel, Felix Wolf, Gabriel Wittum: 10,000 Performance Models per Minute - Scalability of the UG4 Simulation Framework. In Proc. of the 21st Euro-Par Conference, Vienna, Austria, volume 9233 of Lecture Notes in Computer Science, pages 519–531, Springer, August 2015.
PDF DOI BibTeX
Christian Iwainsky, Sergei Shudler, Alexandru Calotoiu, Alexandre Strube, Michael Knobloch, Christian Bischof, Felix Wolf: How Many Threads will be too Many? On the Scalability of OpenMP Implementations. In Proc. of the 21st Euro-Par Conference, Vienna, Austria, volume 9233 of Lecture Notes in Computer Science, pages 451–463, Springer, August 2015.
PDF DOI BibTeX
Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf: Exascaling Your Library: Will Your Implementation Meet Your Expectations?. In Proc. of the International Conference on Supercomputing (ICS), Newport Beach, CA, USA, pages 165–175, ACM, June 2015.
PDF DOI BibTeX
Suraj Prabhakaran, Marcel Neumann, Sebastian Rinke, Felix Wolf, Abhishek Gupta, Laxmikant V. Kalé: A Batch System with Efficient Scheduling for Malleable and Evolving Applications. In Proc. of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Hyderabad, India, pages 429–438, IEEE, May 2015.
PDF DOI BibTeX
Zhen Li, Ali Jannesari, Felix Wolf: An Efficient Data-Dependence Profiler for Sequential and Parallel Programs. In Proc. of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Hyderabad, India, pages 484–493, IEEE, May 2015.
PDF DOI BibTeX
Rohit Atre, Ali Jannesari, Felix Wolf: The Basic Building Blocks of Parallel Tasks. In Proc. of the International Workshop on Code Optimisation for Multi and Many Cores, San Francisco, CA, USA, pages 3:1–3:11, ACM, February 2015.
PDF DOI BibTeX
Bo Zhao, Zhen Li, Ali Jannesari, Felix Wolf, Weiguo Wu: Dependence-Based Code Transformation for Coarse-Grained Parallelism. In Proc. of the International Workshop on Code Optimisation for Multi and Many Cores, San Francisco, CA, USA, pages 1:1–1:10, ACM, February 2015.
PDF DOI BibTeX
Zia Ul Huda, Ali Jannesari, Felix Wolf: Using Template Matching to Infer Parallel Design Patterns. ACM Transactions on Architecture and Code Optimization, 11(4):64:1–64:21, January 2015.
PDF DOI BibTeX
Christian Lengauer, Luc Bougé, Felix Wolf (eds.): Special Issue: Euro-Par 2013. Concurrency and Computation: Practice and Experience, 26(14):2345–2346, 2014.
DOI BibTeX
Lucas Theisen, Aamer Shah, Felix Wolf: Down to Earth – How to Visualize Traffic on High-dimensional Torus Networks. In Proc. of VPA: First workshop on Visual Performance Analysis, held in conjunction with Supercomputer 2014, New Orleans, LA, pages 1–6, November 2014.
PDF DOI BibTeX
Suraj Prabhakaran, Mohsin Iqbal, Sebastian Rinke, Christian Windisch, Felix Wolf: A Batch System with Fair Scheduling for Evolving Applications. In Proc. of the 43rd International Conference on Parallel Processing (ICPP), Minneapolis, MN, USA, pages 1–10, September 2014.
PDF DOI BibTeX
Daniel Lorenz, Robert Dietrich, Ronny Tschüter, Felix Wolf: A comparison between OPARI2 and the OpenMP tools interface in the context of Score-P. In Proc. of the 10th International Workshop on OpenMP (IWOMP), Salvador, Brazil, September 2014, volume 8766 of LNCS, pages 161–172, Springer, September 2014.
PDF DOI BibTeX
Gouyong Mao, David Böhme, Marc-André Hermanns, Markus Geimer, Daniel Lorenz, Felix Wolf: Catching Idlers with Ease: A Lightweight Wait-State Profiler for MPI Programs. In EuroMPI '14: Proc. of the 21th European MPI Users' Group Meeting, Kyoto, Japan, pages 103–108, ACM, September 2014.
PDF DOI BibTeX
Chihsong Kuo, Aamer Shah, Akihiro Nomura, Satoshi Matsuoka, Felix Wolf: How File Access Patterns Influence Interference Among Cluster Applications. In Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Madrid, Spain, pages 1–8, IEEE, September 2014.
PDF DOI BibTeX
Felix Wolf, Christian Bischof, Torsten Hoefler, Bernd Mohr, Gabriel Wittum, Alexandru Calotoiu, Christian Iwainsky, Alexandre Strube, Andreas Vogel: Catwalk: A Quick Development Path for Performance Models. In Euro-Par 2014: Parallel Processing Workshops, volume 8805, 8806 of Lecture Notes in Computer Science, Springer, September 2014.
DOI BibTeX
Alexandru Calotoiu, Torsten Hoefler, Felix Wolf: Mass-producing Insightful Performance Models. In Workshop on Modeling & Simulation of Systems and Applications, University of Washington, Seattle, Washington, August 2014.
PDF URL BibTeX
Ali Jannesari, Nico Koprowski, Jochen Schimmel, Felix Wolf: Generating Classified Parallel Unit Tests. In Proc. of the 8th International Conference on Tests and Proofs (TAP), York, UK, volume 8570 of Lecture Notes in Computer Science, pages 117–133, Springer, July 2014.
PDF DOI BibTeX
Andreas Knüpfer, Robert Dietrich, Jens Doleschal, Markus Geimer, Marc-André Hermanns, Christian Rössel, Ronny Tschüter, Bert Wesarg, Felix Wolf: Generic Support for Remote Memory Access Operations in Score-P and OTF2. In Tools for High Performance Computing 2012, Proc. of the 6th Parallel Tools Workshop, Stuttgart, Germany, September 2012, pages 57–74, Springer, 2013.
DOI BibTeX
Bernd Mohr, Vladimir Voevodin, Judit Giménez, Erik Hagersten, Andreas Knüpfer, DmitryA. Nikitenko, Mats Nilsson, Harald Servat, Aamer Shah, Frank Winkler, Felix Wolf, Ilya Zhukov: The HOPSA Workflow and Tools. In Tools for High Performance Computing 2012, Proc. of the 6th Parallel Tools Workshop, Stuttgart, Germany, September 2012, pages 127–146, Springer, 2013.
PDF DOI BibTeX
Bernd Mohr, Felix Wolf, Alexandru Calotoiu, Torsten Hoefler: The Catwalk Project – A Quick Development Path for Performance Models. Innovatives Supercomputing in Deutschland (inSiDE), 11(2):68–71, 2013.
URL BibTeX
Daniel Fried, Zhen Li, Ali Jannesari, Felix Wolf: Predicting Parallelization of Sequential Programs Using Supervised Learning. In Proc. of the 12th IEEE International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, pages 72–77, IEEE, December 2013.
PDF DOI BibTeX
Ali Jannesari, Nico Koprowski, Jochen Schimmel, Felix Wolf, Walter F. Tichy: Detecting Correlation Violations and Data Races by Inferring Non-deterministic Reads. In Proc. of the 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Seoul, Korea, pages 1–9, IEEE, December 2013.
PDF DOI BibTeX
Alexandru Calotoiu, Torsten Hoefler, Marius Poke, Felix Wolf: Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes. In Proc. of the ACM/IEEE Conference on Supercomputing (SC13), Denver, CO, USA, pages 1–12, ACM, November 2013.
PDF DOI BibTeX
Suraj Prabhakaran, Mohsin Iqbal, Sebastian Rinke, Felix Wolf: A Dynamic Resource Management System for Network-Attached Accelerator Clusters. In Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Scheduling and Resource Management for Parallel and Distributed Systems (SRMPDS), Lyon, France, pages 773–782, October 2013.
PDF DOI BibTeX
Sebastian Rinke, Suraj Prabhakaran, Felix Wolf: Efficient Offloading of Parallel Kernels Using MPI_Comm_spawn. In Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Heterogeneous and Unconventional Cluster Architectures and Applications (HUCAA), Lyon, France, pages 877–884, October 2013.
PDF DOI BibTeX
Zhen Li, Ali Jannesari, Felix Wolf: Discovery of Potential Parallelism in Sequential Programs. In Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Parallel Software Tools and Tool Infrastructures (PSTI), Lyon, France, pages 1004–1013, October 2013.
PDF DOI BibTeX
Marc-André Hermanns, Manfred Miklosch, David Böhme, Felix Wolf: Understanding the formation of wait states in applications with one-sided communication. In EuroMPI '13: Proc. of the 20th European MPI Users' Group Meeting, Madrid, Spain, September 15–18, 2013, pages 73–78, New York, NY, USA, ACM, September 2013.
PDF DOI BibTeX
Aamer Shah, Felix Wolf, Sergey Zhumatiy, Vladimir Voevodin: Capturing inter-application interference on clusters. In Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Indianapolis, IN, USA, pages 1–5, IEEE, September 2013.
PDF DOI BibTeX
Felix Wolf, Bernd Mohr, Dieter an Mey (eds.), Euro-Par 2013: Parallel Processing, volume 8097 of Lecture Notes in Computer Science, Advanced Research in Computing and Software Science, Springer, August 2013.
DOI BibTeX
Wolfgang Frings, Dong H. Ahn, Matthew LeGendre, Todd Gamblin, Bronis R. de Supinski, Felix Wolf: Massively Parallel Loading. In Proc. of the 27th International Conference on Supercomputing (ICS), Eugene, OR, USA, pages 389–398, ACM, June 2013.
PDF DOI BibTeX
Daniel Becker, Markus Geimer, Rolf Rabenseifner, Felix Wolf: Extending the scope of the controlled logical clock. Cluster Computing, 16(1):171–189, March 2013.
PDF DOI BibTeX
Andreas Galonska, Paul Gibbon, Frederic Imbeaux, Yann Frauel, Bernard Guillerminet, Gabriele Manduchi, Felix Wolf: Parallel Universal Access Layer: A Scalable I/O Library for Integrated Tokamak Modelling. Computer Physics Communications, 184(3):638–-646, March 2013.
DOI BibTeX
Marc-André Hermanns, Sriram Krishnamoorthy, Felix Wolf: A scalable infrastructure for the performance analysis of passive target synchronization. Parallel Computing, 39(3):132–145, March 2013.
PDF DOI BibTeX
Markus Geimer, Pavel Saviankou, Alexandre Strube, Zoltán Szebenyi, Felix Wolf, Brian J. N. Wylie: Further improving the scalability of the Scalasca toolset. In Proc. of PARA 2010: State of the Art in Scientific and Parallel Computing, Part II: Minisymposium Scalable tools for High Performance Computing, Reykjavik, Iceland, June 6–9 2010, volume 7134 of Lecture Notes in Computer Science, pages 463–474, Springer, 2012.
PDF DOI BibTeX
Dieter an Mey, Scott Biersdorff, Christian Bischof, Kai Diethelm, Dominic Eschweiler, Michael Gerndt, Andreas Knüpfer, Daniel Lorenz, Allen D. Malony, Wolfgang E. Nagel, Yury Oleynik, Christian Rössel, Pavel Saviankou, Dirk Schmidl, Sameer S. Shende, Michael Wagner, Bert Wesarg, Felix Wolf: Score-P: A Unified Performance Measurement System for Petascale Applications. In Proc. of the CiHPC: Competence in High Performance Computing, HPC Status Konferenz der Gauß-Allianz e.V., Schwetzingen, Germany, June 2010, pages 85–97. Gauß-Allianz, Springer, 2012.
PDF DOI BibTeX
Dominic Eschweiler, Michael Wagner, Markus Geimer, Andreas Knüpfer, Wolfgang E. Nagel, Felix Wolf: Open Trace Format 2 - The Next Generation of Scalable Trace Formats and Support Libraries. In Proc. of the Intl. Conference on Parallel Computing (ParCo), Ghent, Belgium, August 30 – September 2 2011, volume 22 of Advances in Parallel Computing, pages 481–490, IOS Press, 2012.
PDF DOI BibTeX
Andreas Knüpfer, Christian Rössel, Dieter an Mey, Scott Biersdorff, Kai Diethelm, Dominic Eschweiler, Markus Geimer, Michael Gerndt, Daniel Lorenz, Allen D. Malony, Wolfgang E. Nagel, Yury Oleynik, Peter Philippen, Pavel Saviankou, Dirk Schmidl, Sameer S. Shende, Ronny Tschüter, Michael Wagner, Bert Wesarg, Felix Wolf: Score-P – A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir. In Tools for High Performance Computing 2011, Proc. of the 5th Parallel Tools Workshop, Dresden, Germany, September 2011, pages 79–91, Springer, 2012.
PDF DOI BibTeX
Christian Rössel, Bernd Mohr, Michael Gerndt, Felix Wolf: Performance Dynamics of Massively Parallel Codes. Innovatives Supercomputing in Deutschland (inSiDE), 10(2):72–73, 2012.
PDF URL BibTeX
David Böhme, Marc-André Hermanns, Felix Wolf: Scalasca. In Entwicklung und Evolution von Forschungssoftware, Rolduc, November 2011, volume 14 of Aachener Informatik-Berichte, Software Engineering, pages 43–48, Shaker, 2012.
BibTeX
Christian Rössel, Bernd Mohr, Felix Wolf: Score-P. In Entwicklung und Evolution von Forschungssoftware, Rolduc, Niederlande, November 2011, volume 14 of Aachener Informatik-Berichte, Software Engineering, pages 23–30, Shaker, 2012.
BibTeX
Daniel Lorenz, Peter Philippen, Dirk Schmidl, Felix Wolf: Profiling of OpenMP tasks with Score-P. In Proc. of the 41st International Conference on Parallel Processing Workshops (ICPPW), Workshop on Parallel Software Tools and Tool Infrastructures (PSTI), pages 444–453, September 2012.
PDF DOI BibTeX
Sebastian Rinke, Daniel Becker, Thomas Lippert, Suraj Prabhakaran, Lidia Westphal, Felix Wolf: A Dynamic Accelerator-Cluster Architecture. In Proc. of the 41st International Conference on Parallel Processing Workshops (ICPPW), Workshop on Scheduling and Resource Management for Parallel and Distributed Systems (SRMPDS), Pittsburgh, PA, USA, pages 357–366, September 2012.
PDF DOI BibTeX
Marc-André Hermanns, Markus Geimer, Bernd Mohr, Felix Wolf: Scalable detection of MPI-2 remote memory access inefficiency patterns. Intl. Journal of High Performance Computing Applications (IJHPCA), 26(3):227–236, August 2012.
PDF DOI BibTeX
Alexandru Calotoiu, Christian Siebert, Felix Wolf: Pattern-Independent Detection of Manual Collectives in MPI Programs. In Proc. of the 18th Euro-Par Conference, Rhodes Island, Greece, volume 7484 of Lecture Notes in Computer Science, pages 28–39, Springer, August 2012.
PDF DOI BibTeX
Dirk Schmidl, Peter Philippen, Daniel Lorenz, Christian Rössel, Markus Geimer, Dieter an Mey, Bernd Mohr, Felix Wolf: Performance Analysis Techniques for Task-Based OpenMP Applications. In Proc. of the 8th International Workshop on OpenMP (IWOMP), Rome, Italy, volume 7312 of Lecture Notes in Computer Science, pages 196–209, Berlin / Heidelberg, Springer, June 2012.
PDF DOI BibTeX
David Böhme, Bronis R. de Supinski, Markus Geimer, Martin Schulz, Felix Wolf: Scalable Critical-Path Based Performance Analysis. In Proc. of the 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Shanghai, China, pages 1330–1340, IEEE, May 2012.
PDF DOI BibTeX
David Böhme, Markus Geimer, Felix Wolf: Characterizing Load and Communication Imbalance in Large-Scale Parallel Applications. In Proc. of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), Shanghai, China, pages 2538–2541, IEEE, May 2012.
PDF DOI BibTeX
Daniel Harlacher, Harald Klimach, Sabine Roller, Christian Siebert, Felix Wolf: Dynamic Load Balancing for Unstructured Meshes on Space-Filling Curves. In Proc. of the IEEE 26th International Parallel and Distributed Processing Symposium (IPDPS) Workshops \& PhD Forum, Shanghai, China, pages 1655–1663, IEEE, May 2012, Workshop on Large-Scale Parallel Processing.
PDF DOI BibTeX
Felix Wolf: Understanding the Formation of Wait States in Parallel Programs. Innovatives Supercomputing in Deutschland (inSiDE), 1(9):94–95, 2011.
URL BibTeX
Felix Wolf: Scalasca. In Encyclopedia of Parallel Computing, pages 1775–1785, Springer, October 2011.
URL BibTeX
Jan Mußler, Daniel Lorenz, Felix Wolf: Reducing the overhead of direct application instrumentation using prior static analysis. In Proc. of the 17th Euro-Par Conference, Bordeaux, France, volume 6852 of Lecture Notes in Computer Science, pages 65–76, Springer, September 2011.
PDF DOI BibTeX
Markus Geimer, Marc-André Hermanns, Christian Siebert, Felix Wolf, Brian J. N. Wylie: Scaling Performance Tool MPI Communicator Management. In Proc. of the 18th European MPI Users' Group Meeting (EuroMPI), Santorini, Greece, volume 6960 of Lecture Notes in Computer Science, pages 178–187, Springer, September 2011.
PDF DOI BibTeX
Christian Siebert, Felix Wolf: Parallel Sorting with Minimal Data. In Proc. of the 18th European MPI Users' Group Meeting (EuroMPI), Santorini, Greece, volume 6960 of Lecture Notes in Computer Science, pages 170–177, Springer, September 2011.
PDF DOI BibTeX
Marc-André Hermanns, Sriram Krishnamoorthy, Felix Wolf: A Scalable Replay-based Infrastructure for the Performance Analysis of One-sided Communication. In Proc. of the 1st Intl. Workshop on High-performance Infrastructure for Scalable Tools (WHIST), held in conjunction with the International Conference on Supercomputing (ICS), Tucson, AZ, USA, June 2011.
PDF BibTeX
Zoltán Szebenyi, Todd Gamblin, Martin Schulz, Bronis R. de Supinski, Felix Wolf, Brian J. N. Wylie: Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs. In Proc. of the 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Anchorage, AK, USA, pages 640–648, IEEE, May 2011.
PDF DOI BibTeX
Zoltán Szebenyi, Felix Wolf, Brian J. N. Wylie: Performance Analysis of Long-running Applications. In Proc. of the 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS) PhD Forum, Anchorage, AK, USA, pages 2100–2103, IEEE, May 2011.
PDF DOI BibTeX
Dominic Eschweiler, Daniel Becker, Felix Wolf: Patterns of inefficient performance behavior in GPU applications. In Proc. of the 19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Ayia Napa, Cyprus, pages 262–266, IEEE, February 2011.
PDF DOI BibTeX
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Daniel Becker, David Böhme, Wolfgang Frings, Marc-André Hermanns, Bernd Mohr, Zoltán Szebenyi: Recent Developments in the Scalasca Toolset. In Tools for High Performance Computing 2009, Proc. of the 3rd Parallel Tools Workshop, Dresden, Germany, September 2009, chapter 4, pages 39–51, Springer, 2010.
PDF DOI BibTeX
Bernd Mohr, Brian J. N. Wylie, Felix Wolf: Performance measurement and analysis tools for extremely scalable systems. Concurrency and Computation: Practice and Experience, 22(16):2212–2229, 2010, (ISC 2008 Award).
PDF DOI BibTeX
Brian J. N. Wylie, Markus Geimer, Bernd Mohr, David Böhme, Zoltán Szebenyi, Felix Wolf: Large-scale performance analysis of Sweep3D with the Scalasca toolset. Parallel Processing Letters, 20(4):397–414, December 2010.
PDF DOI BibTeX
David Böhme, Markus Geimer, Felix Wolf, Lukas Arnold: Identifying the root causes of wait states in large-scale parallel applications. In Proc. of the 39th International Conference on Parallel Processing (ICPP), San Diego, CA, USA, pages 90–100, IEEE, September 2010, Best Paper Award.
PDF DOI BibTeX
Daniel Becker, Markus Geimer, Rolf Rabenseifner, Felix Wolf: Synchronizing the Timestamps of Concurrent Events in Traces of Hybrid MPI/OpenMP Applications. In Proc. of IEEE International Conference on Cluster Computing (CLUSTER), Heraklion, Greece, pages 38–47, IEEE, September 2010.
PDF DOI BibTeX
Daniel Lorenz, Bernd Mohr, Christian Rössel, Dirk Schmidl, Felix Wolf: How to reconcile event-based performance analysis with tasking in OpenMP. In Proc. of 6th Int. Workshop of OpenMP (IWOMP), Tsukuba, Japan, volume 6132 of Lecture Notes in Computer Science, pages 109–121, Springer, June 2010.
PDF DOI BibTeX
Mohammad Shahbaz Memon, Morris Riedel, Ahmed Shiraz Memon, Felix Wolf, Achim Streit, Thomas Lippert, M. Plociennik, M. Owsiak, D. Tskhakaya, Ch. Konz: Lessons Learned From Jointly Using HTC- and HPC-driven e-Science Infrastructures in Fusion Science. In Proc. of the International Conference on Information and Emerging Technologies (ICIET), Karachi, Pakistan, IEEE, June 2010.
PDF DOI BibTeX
Morris Riedel, Bernd Schuller, Michael Rambadt, Mohammad Shahbaz Memon, Ahmed Shiraz Memon, Achim Streit, Thomas Lippert, Stefan J. Zasada, Steven Manos, Peter V. Coveney, Felix Wolf, Dieter Kranzlmüller: Exploring the Potential of Using Multiple E-science Infrastructures with Emerging Open Standards-Based E-health Research Tools. In Proc. of the 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Melbourne, Victoria, Australia, pages 341–348, IEEE, May 2010.
PDF DOI BibTeX
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Erika Ábrahám, Daniel Becker, Bernd Mohr: The Scalasca performance toolset architecture. Concurrency and Computation: Practice and Experience, 22(6):702–719, April 2010.
PDF DOI BibTeX
Brian J. N. Wylie, David Böhme, Bernd Mohr, Zoltán Szebenyi, Felix Wolf: Performance analysis of Sweep3D on Blue Gene/P with the Scalasca toolset. In Proc. 24th International Parallel and Distributed Processing Symposium and Workshops (IPDPS), Atlanta, GA, USA, IEEE, April 2010.
PDF DOI BibTeX
Morris Riedel, Mohammad Shahbaz Memon, Ahmed Shiraz Memon, Achim Streit, Felix Wolf, Thomas Lippert, Moreno Marzolla, Dieter Kranzlmüller, Aleksandr Konstantinov, Oxana Smirnova, Johannes Watzl, Luigi Zangrando: Improvements of Common Open Grid Standards to Increase High Throughput and High Performance Computing Effectiveness on Large-scale Grid and e-Science Infrastructures. In Proc. 24th International Parallel and Distributed Processing Symposium and Workshops (IPDPS), 7th High-Performance Grid Computing Workshop (HPGC), Atlanta, USA, IEEE, April 2010.
PDF DOI BibTeX
David Böhme, Marc-André Hermanns, Markus Geimer, Felix Wolf: Performance Simulation of Non-blocking Communication in Message-Passing Applications. In Proc. of the 2nd Workshop on Productivity and Performance (PROPER) in conjunction with Euro-Par 2009, Delft, The Netherlands, volume 6043 of Lecture Notes in Computer Science, pages 208–217, Springer, March 2010.
PDF DOI BibTeX
Felix Wolf, David Böhme, Markus Geimer, Marc-André Hermanns, Bernd Mohr, Zoltán Szebenyi, Brian J. N. Wylie: Performance Tuning in the Petascale Era. In Proc. of the John von Neumann Institute for Computing (NIC) Symposium 2010, Juelich, Germany, volume 3 of IAS Series, pages 339–346. Forschungszentrum Jülich, John von Neumann-Institut for Computing, February 2010.
PDF BibTeX
Morris Riedel, Achim Streit, Daniel Mallmann, Felix Wolf, Thomas Lippert: Experiences and Requirements for Interoperability Between HTC and HPC-driven e-Science Infrastructure. In Future Application and Middleware Technology on e-Science, pages 113–123, Springer US, January 2010.
PDF DOI BibTeX
Morris Riedel, Wolfgang Frings, Thomas Eickermann, Sonja Habbinga, Paul Gibbon, Daniel Mallmann, Achim Streit, Felix Wolf, Thomas Lippert: Collaborative Interactivity in Parallel HPC Applications. In Proc. of the Instrumenting the Grid (InGrid) 2008 Workshop, Lacco Ameno, Island of Ischia, Italy, pages 249–262, Springer, January 2010.
PDF DOI BibTeX
Zoltán Szebenyi, Brian J. N. Wylie, Felix Wolf: Scalasca Parallel Performance Analyses of PEPC. In Proc. of the 1st Workshop on Productivity and Performance (PROPER) in conjunction with Euro-Par 2008, Las Palmas de Gran Canaria, Spain, volume 5415 of Lecture Notes in Computer Science, pages 305–314, Springer, 2009.
PDF DOI BibTeX
Felix Wolf: Performance Tools for Petascale Systems. Innovatives Supercomputing in Deutschland (inSiDE), 7(2):38–39, 2009.
URL BibTeX
Morris Riedel, Achim Streit, Thomas Lippert, Felix Wolf, Dieter Kranzlmüller: Concepts and Design of an Interoperability Reference Model for Scientific- and Grid Computing Infrastructures. In Proc. of the Applied Computing Conference, in Mathematical Methods and Applied Computing, Volume II, pages 691–698, WSEAS Press, 2009.
PDF BibTeX
Daniel Becker, Rolf Rabenseifner, Felix Wolf, John Linford: Scalable timestamp synchronization for event traces of message-passing applications. Parallel Computing, 35(12):595–607, December 2009.
PDF DOI BibTeX
Morris Riedel, Felix Wolf, Dieter Kranzlmüller, Achim Streit, Thomas Lippert: Research Advances by Using Interoperable e-Science Infrastructures - The Infrastructure Interoperability Reference Model Applied in e-Science. Cluster Computing, 12(4):357–372, December 2009.
PDF DOI BibTeX
Zoltán Szebenyi, Felix Wolf, Brian J. N. Wylie: Space-Efficient Time-Series Call-Path Profiling of Parallel Applications. In Proc. of the ACM/IEEE Conference on Supercomputing (SC09), Portland, OR, USA, ACM, November 2009.
PDF DOI BibTeX
Wolfgang Frings, Felix Wolf, Ventsislav Petkov: Scalable Massively Parallel I/O to Task-Local Files. In Proc. of the ACM/IEEE Conference on Supercomputing (SC09), Portland, OR, USA, ACM, November 2009.
PDF DOI BibTeX
Marc-André Hermanns, Markus Geimer, Bernd Mohr, Felix Wolf: Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns. In Proc. of the 16th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Espoo, Finland, volume 5759 of Lecture Notes in Computer Science, pages 31–41, Springer, September 2009.
PDF DOI BibTeX
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Bernd Mohr: A scalable tool architecture for diagnosing wait states in massively parallel applications. Parallel Computing, 35(7):375–388, July 2009.
PDF DOI BibTeX
Mohammad Shahbaz Memon, Ahmed Shiraz Memon, Morris Riedel, Achim Streit, Felix Wolf: Enabling Grid Interoperability by Extending HPC-driven Job Management with an Open Standard Information Model. In Proc. of the 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS), Shanghai, China, pages 506–511, IEEE, June 2009.
DOI BibTeX
Markus Geimer, Sameer S. Shende, Allen D. Malony, Felix Wolf: A Generic and Configurable Source-Code Instrumentation Component. In Proc. of the International Conference on Computational Science (ICCS), Baton Rouge, LA, USA, volume 5545 of Lecture Notes in Computer Science, pages 696–705, Springer, May 2009.
PDF DOI BibTeX
Daniel Becker, Rolf Rabenseifner, Felix Wolf, John Linford: Replay-based synchronization of timestamps in event traces of massively parallel applications. Scalable Computing: Practice and Experience, 10(1):49–60, March 2009.
PDF URL BibTeX
Morris Riedel, E. Laure, Th. Soddemann, L. Field, J. P. Navarro, J. Casey, M. Litmaath, J. Ph. Baud, B. Koblitz, C. Catlett, D. Skow, C. Zheng, P.-M. Papadopoulos, M. Katz, N. Sharma, O. Smirnova, B. Kónya, P. Arzberger, F. Würthwein, A. S. Rana, T. Martin, M. Wan, V. Welch, T. Rimovsky, S. Newhouse, A. Vanni, Y. Tanaka, Y. Tanimura, T. Ikegami, D. Abramson, C. Enticott, G. Jenkins, R. Pordes, S. Timm, G. Moont, M. Aggarwal, D. Colling, O. van der Aa, A. Sim, V. Natarajan, A. Shoshani, J. Gu, G. Galang, R. Zappi, L. Magnoni, V. Ciaschini, M. Pace, Valerio Venturi, Moreno Marzolla, Paolo Andreetto, B. Cowles, S. Wang, Y. Saeki, H. Sato, S. Matsuoka, P. Uthayopas, S. Sriprayoonsakul, O. Koeroo, M. Viljoen, L. Pearlman, S. Pickles, D. Wallom, G. Moloney, J. Lauret, J. Marsteller, P. Sheldon, S. Pathak, S. De Witt, J. Mencák, J. Jensen, M. Hodges, D. Ross, S. Phatanapherom, G. Netzer, A. R. Gregersen, M. Jones, S. Chen, P. Kacsuk, Achim Streit, Daniel Mallmann, Felix Wolf, Thomas Lippert, Th. Delaitre, E. Huedo, N. Geddes: Interoperation of World-Wide Production e-Science Infrastructures. Concurrency and Computation: Practice and Experience, 21(8):961–990, March 2009.
PDF DOI BibTeX
Marc-André Hermanns, Markus Geimer, Felix Wolf, Brian J. N. Wylie: Verifying Causality Between Distant Performance Phenomena in Large-Scale MPI Applications. In Proc. of the 17th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Weimar, Germany, pages 78–84, IEEE, February 2009.
PDF DOI BibTeX
Brian J. N. Wylie, Markus Geimer, Felix Wolf: Performance measurement and analysis of large-scale parallel applications on leadership computing systems. Scientific Programming, 16(2-3):167–181, 2008.
PDF URL DOI BibTeX
Felix Wolf, Brian J. N. Wylie, Erika Ábrahám, Daniel Becker, Wolfgang Frings, Karl Fürlinger, Markus Geimer, Marc-André Hermanns, Bernd Mohr, Shirley Moore, Matthias Pfeifer, Zoltán Szebenyi: Usage of the SCALASCA Toolset for Scalable Performance Analysis of Large-Scale Parallel Applications. In Tools for High Performance Computing, Proc. of the 2nd Parallel Tools Workshop, Stuttgart, Germany, July 2008, pages 157–167, Springer, 2008.
PDF DOI BibTeX
Morris Riedel, Achim Streit, Felix Wolf, Thomas Lippert, Dieter Kranzlmüller: Classification of Different Approaches for e-Science Applications in Next Generation Computing Infrastructures. In Proc. of the 4th IEEE Conference on e-Science (e-Science), Indianapolis, USA, pages 198–205, December 2008.
PDF DOI BibTeX
Daniel Becker, Rolf Rabenseifner, Felix Wolf: Implications of non-constant clock drifts for the timestamps of concurrent events. In Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Tsukuba, Japan, pages 59–68, IEEE, September 2008.
PDF DOI BibTeX
Morris Riedel, Wolfgang Frings, Sonja Habbinga, Thomas Eickermann, Daniel Mallmann, Achim Streit, Felix Wolf, Thomas Lippert: Extending the Collaborative Online Visualization and Steering Framework for Computational Grids with Attribute-based Authorization. In Proc. of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, pages 104–111, IEEE, September 2008.
PDF DOI BibTeX
Daniel Becker, John Linford, Rolf Rabenseifner, Felix Wolf: Replay-based synchronization of timestamps in event traces of massively parallel applications. In Proc. of the International Conference on Parallel Processing Workshops (ICPPW), 1st International Workshop on Simulation and Modelling in Emergent Computational Systems (SMECS), Portland, OR, USA, pages 212–219, IEEE, September 2008.
PDF DOI BibTeX
Daniel Becker, Morris Riedel, Achim Streit, Felix Wolf: Grid-Based Workflow Management for Automatic Performance Analysis of Massively Parallel Applications. In Proc. of the 3rd CoreGRID Workshop on Grid Middleware, Barcelona, Spain of CoreGRID Series, pages 103–118, Springer, June 2008.
PDF DOI BibTeX
Zoltán Szebenyi, Brian J. N. Wylie, Felix Wolf: SCALASCA Parallel Performance Analyses of SPEC MPI2007 Applications. In Proc. of the 1st SPEC International Performance Evaluation Workshop (SIPEW), Darmstadt, Germany, volume 5119 of Lecture Notes in Computer Science, pages 99–123, Springer, June 2008.
PDF DOI BibTeX
Oscar Hernandez, Fengguang Song, Barbara Chapman, Jack Dongarra, Bernd Mohr, Shirley Moore, Felix Wolf: Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications. In Proc. of the 2nd International Workshop on OpenMP (IWOMP 2006), Reims, France, volume 4315 of Lecture Notes in Computer Science, pages 267–278, Springer, June 2008.
PDF DOI BibTeX
Morris Riedel, Ahmed Shiraz Memon, Mohammad Shahbaz Memon, Daniel Mallmann, Achim Streit, Felix Wolf, Thomas Lippert, Valerio Venturi, Paolo Andreetto, Moreno Marzolla, Andrea Ferraro, Antonia Ghiselli, Fredrik Hedman, Zeeshan Ali Shah, Jean Salzemann, Ana Da Costa, Vincent Breton, Vinod Kasam, Martin Hofmann-Apitius, David Snelling, Sven van den Berghe, Vivian Li, Steve Brewer, Alistair Dunlop, Nishadi De Silva: Improving e-Science with Interoperability of the e-Infrastructures EGEE and DEISA. In Proc. of the 31st International Convention MIPRO, Conference on Grid and Visualization Systems (GVS), Opatija, Croatia, pages 225–231, Croatian Society for Information and Communication Technology, Electronics and Microelectronics, May 2008.
PDF BibTeX
Marc-André Hermanns, Markus Geimer, Felix Wolf, Brian J. N. Wylie: Verifying Causal Connections between Distant Performance Phenomena in Large-Scale Message-Passing Applications. Technical Report FZJ-JSC-IB-2008-05, Forschungszentrum Jülich, April 2008.
PDF BibTeX
Daniel Becker, Wolfgang Frings, Felix Wolf: Performance Evaluation and Optimization of Parallel Grid Computing Applications. In Proc. of the 16th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Toulouse, France, pages 193–199, IEEE, February 2008.
PDF DOI BibTeX
Felix Wolf, Daniel Becker, Markus Geimer, Brian J. N. Wylie: Scalable Performance Analysis Methods for the Next Generation of Supercomputers. In Proc. of the John von Neumann Institute for Computing (NIC) Symposium, Jülich, Germany, volume 39 of NIC-Series, pages 315–322, February 2008.
PDF BibTeX
Markus Geimer, Felix Wolf, Andreas Knüpfer, Bernd Mohr, Brian J. N. Wylie: A Parallel Trace-Data Interface for Scalable Performance Analysis. In Proc. of the 8th International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), Umeå, Sweden, June 2006, volume 4699 of Lecture Notes in Computer Science, pages 398–408, Springer, 2007.
PDF DOI BibTeX
Brian J. N. Wylie, Felix Wolf, Bernd Mohr, Markus Geimer: Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis. In Proc. of the 8th International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), Umeå, Sweden, June 2006, volume 4699 of Lecture Notes in Computer Science, pages 460–469, Springer, 2007.
PDF DOI BibTeX
Christian Bischof, Felix Wolf: Produktivität versus Performanz in der Simulation. RWTH Themen, 2:38–39, 2007.
BibTeX
M. Behbahani, Marek Behr, Christian Bischof, Felix Wolf: Kranken Herzen helfen. RWTH Themen, 1:44–46, 2007.
BibTeX
Daniel Becker, Wolfgang Frings, Felix Wolf: Performance Evaluation and Optimization of Metacomputing Applications. In Proc. of the 3rd Workshop on Communication in Cluster- and Grid-Systems (KiCC, Kommunikation in Clusterrechnern und Clusterverbundsystemen), Aachen, Germany, pages 32–39. RWTH Aachen University, December 2007.
PDF URL BibTeX
Morris Riedel, T. Eickermann, S. Habbinga, Wolfgang Frings, Paul Gibbon, Daniel Mallmann, Achim Streit, Thomas Lippert, Felix Wolf, W. Schiffmann, A. Ernst, R. Spurzem, W. E. Nagel: Computational Steering and Online Visualization of Scientific Applications on Large-Scale HPC Systems within e-Science Infrastructures. In Proc. of 3rd IEEE International Conference on e-Science and Grid Computing, Bangalore, India, pages 483–490, IEEE, December 2007.
URL DOI BibTeX
Markus Geimer, Björn Kuhlmann, Farzona Pulatova, Felix Wolf, Brian J. N. Wylie: Scalable Collation and Presentation of Call-Path Profile Data with CUBE. In Proc. of the Conference on Parallel Computing (ParCo), Aachen/Jülich, Germany, pages 645–652, September 2007, Minisymposium Scalability and Usability of HPC Programming Tools.
PDF BibTeX
Daniel Becker, Rolf Rabenseifner, Felix Wolf: Timestamp Synchronization for Event Traces of Large-Scale Message-Passing Applications. In Proc. of the 14th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Paris, France, volume 4757 of Lecture Notes in Computer Science, pages 315–325, Springer, September 2007.
PDF DOI BibTeX
Morris Riedel, Thomas Eickermann, Wolfgang Frings, Sonja Dominiczak, Daniel Mallmann, Thomas Düssel, Achim Streit, Paul Gibbon, Felix Wolf, Wolfram Schiffmann, Thomas Lippert: Design and Evaluation of a Collaborative Online Visualization and Steering Framework Implementation for Computational Grids. In Proc. of the 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin, Texas, USA, pages 169–177, September 2007.
PDF DOI BibTeX
Morris Riedel, Wolfgang Frings, Sonja Dominiczak, Thomas Eickermann, Thomas Düssel, Paul Gibbon, Daniel Mallmann, Felix Wolf, Wolfram Schiffmann: Requirements and Design of a Collaborative Online Visualization and Steering Framework for Grid and e-Science Infrastructures. In Proc. of the German e-Science Conference, Baden-Baden, Germany, Max Planck Digital Library - ID 316630.0, May 2007.
PDF BibTeX
Allen D. Malony, Sameer S. Shende, Alan Morris, Felix Wolf: Compensation of Measurement Overhead in Parallel Performance Profiling. International Journal of High Performance Computing Applications, 21(2):174–194, May 2007.
PDF DOI BibTeX
Daniel Becker, Felix Wolf, Wolfgang Frings, Markus Geimer, Brian J. N. Wylie, Bernd Mohr: Automatic Trace-Based Performance Analysis of Metacomputing Applications. In Proc. of the International Parallel and Distributed Processing Symposium (IPDPS), Long Beach, CA, USA, IEEE, March 2007.
PDF DOI BibTeX
Felix Wolf, Bernd Mohr, Jack Dongarra, Shirley Moore: Automatic analysis of inefficiency patterns in parallel applications. Concurrency and Computation: Practice and Experience, 19(11):1481–1496, February 2007.
PDF DOI BibTeX
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Bernd Mohr: Scalable Parallel Trace-Based Performance Analysis. Innovatives Supercomputing in Deutschland (inSiDE), 4(2):16–19, 2006.
PDF URL BibTeX
Markus Geimer, Felix Wolf, Brian J. N. Wylie, Bernd Mohr: Scalable Parallel Trace-Based Performance Analysis. In Proc. of the 13th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Bonn, Germany, volume 4192 of Lecture Notes in Computer Science, pages 303–312, Springer, September 2006.
PDF DOI BibTeX
Andrej Kühnal, Marc-André Hermanns, Bernd Mohr, Felix Wolf: Specification of Inefficiency Patterns for MPI-2 One-sided Communication. In Proc. of the 12th Euro-Par Conference, Dresden, Germany, volume 4128 of Lecture Notes in Computer Science, pages 47–62, Springer, August 2006.
PDF DOI BibTeX
Gaby Aguilera, Patricia J. Teller, Michaela Taufer, Felix Wolf: A Systematic Multi-step Methodology for Performance Analysis of Communication Traces of Distributed Applications based on Hierarchical Clustering. In Proc. of the 5th International Workshop on Performance Modeling, Evaluation, and Organization of Parallel and Distributed Systems (PMEO-PDS, in conjunction with IPDPS 2006), Rhodes Island, Greece, IEEE, April 2006.
PDF DOI BibTeX
Felix Wolf, Felix Freitag, Bernd Mohr, Shirley Moore, Brian J. N. Wylie: Large Event Traces in Parallel Performance Analysis. In Proc. of the 8th Workshop on Parallel Systems and Algorithms (PASA), Frankfurt, Germany, volume P-81 of Lecture Notes in Informatics, pages 264–273, Gesellschaft für Informatik, March 2006.
PDF BibTeX
Felix Wolf, Allen D. Malony, Sameer S. Shende, Alan Morris: Trace-Based Parallel Performance Overhead Compensation. In Proc. of the International Conference on High Performance Computing and Communications (HPCC), Sorrento, Italy, volume 3726 of Lecture Notes in Computer Science, pages 617–628, Springer, September 2005.
PDF DOI BibTeX
Sameer S. Shende, Allen D. Malony, Alan Morris, Felix Wolf: Performance Profiling Overhead Compensation for MPI Programs. In Proc. of the 12th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Sorrento, Italy, volume 3666 of Lecture Notes in Computer Science, pages 359–367, Springer, September 2005.
PDF DOI BibTeX
Shirley Moore, Felix Wolf, Jack Dongarra, Sameer S. Shende, Allen D. Malony, Bernd Mohr: A Scalable Approach to MPI Application Performance Analysis. In Proc. of the 12th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Sorrento, Italy, volume 3666 of Lecture Notes in Computer Science, pages 309–316, Springer, September 2005.
PDF DOI BibTeX
Brian J. N. Wylie, Bernd Mohr, Felix Wolf: Holistic Hardware Counter Performance Analysis of Parallel Programs. In Proc. of the Conference on Parallel Computing (ParCo), Malaga, Spain, pages 187–194, September 2005.
PDF BibTeX
Bernd Mohr, Andrej Kühnal, Marc-André Hermanns, Felix Wolf: Performance Analysis of One-sided Communication Mechanisms. In Proc. of the Conference on Parallel Computing (ParCo), Malaga, Spain, September 2005, Minisymposium Performance Analysis.
PDF BibTeX
Marc-André Hermanns, Bernd Mohr, Felix Wolf: Event-based Measurement and Analysis of One-sided Communication. In Proc. of the 11th Euro-Par Conference, Lisboa, Portugal, volume 3648 of Lecture Notes in Computer Science, pages 156–165, Springer, August 2005.
PDF DOI BibTeX
Nikhil Bhatia, Fengguang Song, Felix Wolf, Bernd Mohr, Jack Dongarra, Shirley Moore: Automatic Experimental Analysis of Communication Patterns in Virtual Topologies. In Proc. of the International Conference on Parallel Processing (ICPP), Oslo, Norway, pages 465–472, IEEE Society, June 2005.
PDF DOI BibTeX
P. Worley, J. Candy, L. Carrington, K. Huck, T. Kaiser, G. Mahinthakumar, Allen D. Malony, Shirley Moore, D. Reed, P. Roth, H. Shan, Sameer S. Shende, A. Snavely, S. Sreepathi, Felix Wolf, Y. Zhang: Performance Analysis of GYRO: A Tool Evaluation. In Proc. of the 2005 SciDAC Conference, San Francisco, CA, USA, June 2005.
PDF BibTeX
Nikhil Bhatia, Shirley Moore, Felix Wolf, Jack Dongarra, Bernd Mohr: A Pattern-Based Approach to Automated Application Performance Analysis. In Workshop on Patterns in High Performance Computing (patHPC 2005), Urbana-Champaign, IL, USA, May 2005.
PDF BibTeX
Shirley Moore, Felix Wolf, Jack Dongarra, Bernd Mohr: Improving Time to Solution with Automated Performance Analysis. In 2nd Workshop on Productivity and Performance in High-End Computing (P-PHEC), San Francisco, CA, USA, February 2005.
PDF BibTeX
Felix Wolf, Bernd Mohr, Jack Dongarra, Shirley Moore: Efficient Pattern Search in Large Traces through Successive Refinement. In Proc. of the 10th Euro-Par Conference, Pisa, Italy, volume 3149 of Lecture Notes in Computer Science, pages 47–54, Springer, August 2004.
PDF DOI BibTeX
Fengguang Song, Felix Wolf, Nikhil Bhatia, Jack Dongarra, Shirley Moore: An Algebra for Cross-Experiment Performance Analysis. In Proc. of the International Conference on Parallel Processing (ICPP), Montreal, Canada, pages 63–72, IEEE Society, August 2004.
PDF DOI BibTeX
Philip Mucci, Jack Dongarra, Rick Kufrin, Shirley Moore, Fengguang Song, Felix Wolf: Automating the Large-Scale Collection and Analysis of Performance Data on Linux Clusters. In 5th LCI International Conference on Linux Clusters: The HPC Revolution, Austin, TX, USA, May 2004.
PDF URL BibTeX
Felix Wolf, Bernd Mohr: Automatic performance analysis of hybrid MPI/OpenMP applications. Journal of Systems Architecture, 49(10-11):421–439, November 2003.
PDF DOI BibTeX
Felix Wolf, Bernd Mohr: Hardware-Counter Based Automatic Performance Analysis of Parallel Programs. In Proc. of the Conference on Parallel Computing (ParCo), Dresden, Germany, volume 13 of Advances in Parallel Computing, pages 753–760, Elsevier, September 2003, Minisymposium Performance Analysis.
PDF DOI BibTeX
Felix Wolf, Bernd Mohr: KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Applications. In Proc. of the 9th Euro-Par Conference, Klagenfurt, Austria, volume 2790 of Lecture Notes in Computer Science, pages 1301–1304, Springer, August 2003, Demonstrations of Parallel and Distributed Computing.
PDF DOI BibTeX
Felix Wolf: Automatic Performance Analysis on Parallel Computers with SMP Nodes. PhD thesis, RWTH Aachen, Forschungszentrum Jülich, February 2003, NIC Series Volume 17, ISBN 3-00-010003-2.
URL BibTeX
Felix Wolf, Bernd Mohr: Automatic Performance Analysis of Hybrid MPI/OpenMP Applications. In Proc. of 11th Euromicro Workshop on Parallel Distributed and Network-Based Processing (PDP), Genua, Italy, pages 13–22, IEEE, February 2003.
PDF DOI BibTeX
Luiz A. DeRose, Felix Wolf: CATCH – A Call-Graph Based Automatic Tool for Capture of Hardware Performance Metrics for MPI and OpenMP Applications. In Proc. of the 8th Euro-Par Conference, Paderborn, Germany, volume 2400 of Lecture Notes in Computer Science, pages 167–176, Springer, August 2002.
PDF DOI BibTeX
Bernd Mohr, Allen D. Malony, Sameer S. Shende, Felix Wolf: Design and Prototype of a Performance Tool Interface for OpenMP. The Journal of Supercomputing, 23(1):105–128, August 2002.
PDF DOI BibTeX
Bernd Mohr, Allen D. Malony, Sameer S. Shende, Felix Wolf: Design and Prototype of a Performance Tool Interface for OpenMP. In 2nd Annual Los Alamos Computer Science Institute Symposium (LACSI), Santa Fe, NM, USA, October 2001.
PDF BibTeX
Felix Wolf, Bernd Mohr: Specifying Performance Properties of Parallel Applications Using Compound Events. Parallel and Distributed Computing Practices, 4(3):301–317, September 2001.
PDF URL BibTeX
Bernd Mohr, Allen D. Malony, Sameer S. Shende, Felix Wolf: Towards a Performance Tool Interface for OpenMP: An Approach based on Directive Rewriting. In 3rd European Workshop on OpenMP (EWOMP), Barcelona, Spain, September 2001.
PDF BibTeX
Thomas Fahringer, Michael Gerndt, Bernd Mohr, G. Riley, J. L. Träff, Felix Wolf: Knowledge Specification for Automatic Performance Analysis FZJ-ZAM-IB-2001-08, ESPRIT IV Working Group APART, Forschungszentrum Jülich, August 2001, Revised version.
PDF BibTeX
Felix Wolf, Bernd Mohr: Automatic Performance Analysis of MPI Applications Based on Event Traces. In Proc. of the 6th Euro-Par Conference, Munich, Germany, volume 1900 of Lecture Notes in Computer Science, pages 123–132, Springer, August 2000.
PDF DOI BibTeX
Felix Wolf, Bernd Mohr: EARL - A Programmable and Extensible Toolkit for Analyzing Event Traces of Message Passing Programs. In Proc. of the 7th International Conference on High Performance Computing and Networking Europe (HPCN), Amsterdam, The Netherlands, volume 1593 of Lecture Notes in Computer Science, pages 503–512, Springer, April 1999.
PDF DOI BibTeX
Michael Gerndt, Bernd Mohr, Felix Wolf, Mario Pantano: Performance Analysis on Cray T3E. In Proc. of the 7th Euromicro Workshop on Parallel and Distributed Processing (PDP), Funchal, Madeira, Portugal, pages 241–248, IEEE, February 1999.
PDF URL BibTeX