Muhammad Haseeb

Senior Software Engineer @ NVIDIA
2788 San Tomas Expressway
Santa Clara, California, USA

Email: mhsb123@gmail.com

Professional Summary

Experienced software engineer developing cutting-edge software (RAPIDS) for GPU-accelerated Data Science, AI, and scientific computing. Skilled in modern C++, CUDA C++, Python, Cython, CMake, Programming Models, Supercomputing, Data structures and Algorithms, Performance Engineering and Software Engineering.

Work Experience

2024 - Now | Senior Software Engineer - RAPIDS | NVIDIA | Santa Clara, CA
Developing GPU accelerated software for data science and AI in NVIDIA RAPIDS using modern C++, CUDA C++, CCCL, Cython, and Python.

2023 - 24     | HPC Infrastructure & Performance Postdoc | NERSC, Lawrence Berkeley Nat'l Lab | Berkeley, CA
Developed GPU accelerated scientific software using new technologies in Programming Models (MPI, CUDA, SYCL, Kokkos, OpenMP-offload, AMReX), and C++ (stdexec, parSTL). Model GPU-GPU communications in HPC applications over Perlmutter supercomputer interconnects.

2020 & 21   | HPC Application Performance Intern | NERSC, Lawrence Berkeley Nat'l Lab | Berkeley, CA
(2021): Developed and evaluated DPC++ (SYCL) and Python-based CPU-GPU software for ADEPT sequence alignment kernels. (2020): Contributed to the development of dynamic instrumentation for MPI, Python and C++, as well as CI/CD for Timemory.

2018 - 23     | Graduate Research Assistant | Florida International University | Miami, FL
Developed parallel algorithms, data structures, and GPU kernels to scalably accelerate computational proteomics algorithms by > 40x on modern supercomputers.

2017 - 18     | Graduate Research Assistant | Western Michigan University | Kalamazoo, MI
Developed a sparse data structure to optimize the space complexity of the fragment-ion database index by 2x.

2015 - 17     | Senior Software Engineer | Mentor Graphics Corporation (now Siemens EDA) | Lahore, Pakistan
Contributed to the development of system partitioning, low-level drivers, remote life-cycle management, inter-processor communication, and tracing features for the Mentor Embedded Nucleus RTOS and the Mentor Embedded Multicore Framework (MEMF).

Education

2023 | Ph.D. Computer Science (Advisor: Fahad Saeed) | Florida International University | Miami, FL

2015 | B.Sc. Electrical Engineering | University of Engineering and Technology Lahore | Lahore, Pakistan

Software Development

C++ • CUDA C++ • Python (+ common packages) RAPIDS CCCL • Nsight tools • CMake • Cython • NCCL • NVSHMEM • MPI • OpenMP • Kokkos • PyBind11 • AMReX • Git • DPC++/SYCL • Intel tools • Embedded C • Torch • Bash

Interests

C++ Evolution • Accelerator Architecture • CPU-GPU Architectures • Performance Engineering • Algorithms & Data Structures • Software Engineering • Experimental Technologies • Accelerator Programming Models

Actual Interests

Tolkien's Legendarium (Middle Earth Lore) • National Park Exploration • Maps and Geography • Video GamesMovies and TV • Sports - Pickleball, Ping Pong, Basketball (Go GSWarriors)


Select Publications

2023 | Muhammad Haseeb, and Fahad Saeed. "GPU-Acceleration of the Distributed-Memory Database Peptide Search of Mass Spectrometry Data.", Nature Scientific Reports, 13, 18713 (2023). https://doi.org/10.1038/s41598-023-43033-w

2023 | Nan Ding, Muhammad Haseeb, Taylor Groves, and Samuel Williams. "Evaluating Performance of One-sided MPI communication on GPUs and GPUs.", 2023 International Workshop on Performance, Portability and Productivity in HPC (P3HPC - SC '23), IEEE, 2023.

2023 | LeAnn Lindsey, Muhammad Haseeb, Hari Sundar, and Muaaz Awan. "TANGO: A GPU-Optimized Traceback Approach for Sequence Alignment Algorithms.", 13th International Workshop on Irregular Applications: Architectures and Algorithms (IA^3 - SC '23), IEEE, 2023.

2021 | Muhammad Haseeb, and Fahad Saeed. "High performance computing framework for tera-scale database search of mass spectrometry data." Nature Computational Science, Volume no. 1, Issue no. 8 (2021): pp. 550-561.

See more at Google Scholar.

Patents

2022 | Muhammad Haseeb, and Fahad Saeed. "Systems and Methods for Peptide Identification." U.S. Patent 11,309,061 issued April 19, 2022.

2020 | Fahad Saeed, and Muhammad Haseeb. "Methods and systems for compressing data." U.S. Patent 10,810,180, issued October 20, 2020.

Awards & Honors

2021 & 22 | Invention Award Bonuses, Office of Research and Economic Development, FIU

2021          | Best Graduate Student Research Award, School of Computing and Information Sciences, FIU

2019 & 22 | Various Travel Grants, School of Computing and Information Sciences, FIU

2018 - 23  | Tuition Waiver for PhD studies, School of Computing and Information Sciences, FIU

2016 - 17  | Role Model Software Engineer, Mentor Graphics Corporation (now Siemens EDA)