default search action

combined dblp search
author search
venue search
publication search

ask others

Dhiraj D. Kalamkar

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/GeorganasKVKNPB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/GeorganasKVKNPB24
Evangelos Georganas, Dhiraj D. Kalamkar, Kirill Voronin, Abhisek Kundu, Antonio Noack, Hans Pabst, Alexander Breuer, Alexander Heinecke:
Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures. IPDPS 2024: 950-963
2023
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12576
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12576
Evangelos Georganas, Dhiraj D. Kalamkar, Kirill Voronin, Antonio Noack, Hans Pabst, Alexander Breuer, Alexander Heinecke:
Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures. CoRR abs/2304.12576 (2023)
2022
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/fams/GeorganasKAAAAB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fams/GeorganasKAAAAB22
Evangelos Georganas, Dhiraj D. Kalamkar, Sasikanth Avancha, Menachem Adelman, Deepti Aggarwal, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Md. Vasimuddin, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Brian Retford, Barukh Ziv, Alexander Heinecke:
Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning and HPC Workloads. Frontiers Appl. Math. Stat. 8: 826269 (2022)
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/ChaudharyMKHGZA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/ChaudharyMKHGZA22
Narendra Chaudhary, Sanchit Misra, Dhiraj D. Kalamkar, Alexander Heinecke, Evangelos Georganas, Barukh Ziv, Menachem Adelman, Bharat Kaul:
Accelerating Deep Learning based Identification of Chromatin Accessibility from noisy ATAC-seq Data. IPDPS Workshops 2022: 176-185
2021
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/GeorganasKAAABB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/GeorganasKAAABB21
Evangelos Georganas, Dhiraj D. Kalamkar, Sasikanth Avancha, Menachem Adelman, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Md. Vasimuddin, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Barukh Ziv, Alexander Heinecke:
Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads. SC 2021: 14
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/MdMMMGHKAA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/MdMMMGHKAA21
Md. Vasimuddin, Sanchit Misra, Guixiang Ma, Ramanarayan Mohanty, Evangelos Georganas, Alexander Heinecke, Dhiraj D. Kalamkar, Nesreen K. Ahmed, Sasikanth Avancha:
DistGNN: scalable distributed training for large-scale graph neural networks. SC 2021: 76
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-05755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-05755
Evangelos Georganas, Dhiraj D. Kalamkar, Sasikanth Avancha, Menachem Adelman, Cristina Anderson, Alexander Breuer, Narendra Chaudhary, Abhisek Kundu, Md. Vasimuddin, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Barukh Ziv, Alexander Heinecke:
Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning Workloads. CoRR abs/2104.05755 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06700
Md. Vasimuddin, Sanchit Misra, Guixiang Ma, Ramanarayan Mohanty, Evangelos Georganas, Alexander Heinecke, Dhiraj D. Kalamkar, Nesreen K. Ahmed, Sasikanth Avancha:
DistGNN: Scalable Distributed Training for Large-Scale Graph Neural Networks. CoRR abs/2104.06700 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-08002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-08002
Narendra Chaudhary, Sanchit Misra, Dhiraj D. Kalamkar, Alexander Heinecke, Evangelos Georganas, Barukh Ziv, Menachem Adelman, Bharat Kaul:
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning. CoRR abs/2104.08002 (2021)
2020
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/Georganas0KAVAH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/Georganas0KAVAH20
Evangelos Georganas, Kunal Banerjee, Dhiraj D. Kalamkar, Sasikanth Avancha, Anand Venkat, Michael J. Anderson, Greg Henry, Hans Pabst, Alexander Heinecke:
Harnessing Deep Learning via a Single Building Block. IPDPS 2020: 222-233
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/KalamkarGSCSH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/KalamkarGSCSH20
Dhiraj D. Kalamkar, Evangelos Georganas, Sudarshan Srinivasan, Jianping Chen, Mikhail Shiryaev, Alexander Heinecke:
Optimizing deep learning recommender systems training on CPU cluster architectures. SC 2020: 43
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04680
Dhiraj D. Kalamkar, Evangelos Georganas, Sudarshan Srinivasan, Jianping Chen, Mikhail Shiryaev, Alexander Heinecke:
Optimizing Deep Learning Recommender Systems' Training On CPU Cluster Architectures. CoRR abs/2005.04680 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/superfri/0001GKZSAH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/superfri/0001GKZSAH19
Kunal Banerjee, Evangelos Georganas, Dhiraj D. Kalamkar, Barukh Ziv, Eden Segal, Cristina Anderson, Alexander Heinecke:
Optimizing Deep Learning RNN Topologies on Intel Architecture. Supercomput. Front. Innov. 6(3): 64-85 (2019)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/cluster/KalamkarBS0GSXH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cluster/KalamkarBS0GSXH19
Dhiraj D. Kalamkar, Kunal Banerjee, Sudarshan Srinivasan, Srinivas Sridharan, Evangelos Georganas, Mikhail E. Smorkalov, Cong Xu, Alexander Heinecke:
Training Google Neural Machine Translation on an Intel CPU Cluster. CLUSTER 2019: 1-10
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12322
Dhiraj D. Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey:
A Study of BFLOAT16 for Deep Learning Training. CoRR abs/1905.12322 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-06440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-06440
Evangelos Georganas, Kunal Banerjee, Dhiraj D. Kalamkar, Sasikanth Avancha, Anand Venkat, Michael J. Anderson, Greg Henry, Hans Pabst, Alexander Heinecke:
High-Performance Deep Learning via a Single Building Block. CoRR abs/1906.06440 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-07729
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-07729
Abhisek Kundu, Sudarshan Srinivasan, Eric C. Qin, Dhiraj D. Kalamkar, Naveen K. Mellempudi, Dipankar Das, Kunal Banerjee, Bharat Kaul, Pradeep Dubey:
K-TanH: Hardware Efficient Activations For Deep Learning. CoRR abs/1909.07729 (2019)
2018
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/0002MMKAB0VKGHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0002MMKAB0VKGHD18
Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj D. Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesús Corbal, Nikita Shustrov, Roman Dubtsov, Evarist Fomenko, Vadim O. Pirogov:
Mixed Precision Training of Convolutional Neural Networks using Integer Operations. ICLR (Poster) 2018
[c12]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/sc/GeorganasABKHPH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/GeorganasABKHPH18
Evangelos Georganas, Sasikanth Avancha, Kunal Banerjee, Dhiraj D. Kalamkar, Greg Henry, Hans Pabst, Alexander Heinecke:
Anatomy of high-performance deep learning convolutions on SIMD architectures. SC 2018: 66:1-66:12
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-08030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-08030
Srinivas Sridharan, Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Dipankar Das, Mikhail E. Smorkalov, Mikhail Shiryaev, Dheevatsa Mudigere, Naveen Mellempudi, Sasikanth Avancha, Bharat Kaul, Pradeep Dubey:
On Scale-out Deep Learning Training for Cloud and HPC. CoRR abs/1801.08030 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-00930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-00930
Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj D. Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesús Corbal, Nikita Shustrov, Roman Dubtsov, Evarist Fomenko, Vadim O. Pirogov:
Mixed Precision Training of Convolutional Neural Networks using Integer Operations. CoRR abs/1802.00930 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05567
Evangelos Georganas, Sasikanth Avancha, Kunal Banerjee, Dhiraj D. Kalamkar, Greg Henry, Hans Pabst, Alexander Heinecke:
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures. CoRR abs/1808.05567 (2018)
2016
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijhpca/ParkSVHKPPDLRMD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijhpca/ParkSVHKPPDLRMD16
Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Md. Mostofa Ali Patwary, Vadim O. Pirogov, Pradeep Dubey, Xing Liu, Carlos Rosales, Cyril Mazauric, Christopher S. Daley:
Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors. Int. J. High Perform. Comput. Appl. 30(1): 11-27 (2016)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/supercomputer/JooKKVW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/supercomputer/JooKKVW16
Bálint Joó, Dhiraj D. Kalamkar, Thorsten Kurth, Karthikeyan Vaidyanathan, Aaron Walden:
Optimizing Wilson-Dirac Operator and Linear Solvers for Intel® KNL. ISC Workshops 2016: 415-427
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/0002AMVSKKD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/0002AMVSKKD16
Dipankar Das, Sasikanth Avancha, Dheevatsa Mudigere, Karthikeyan Vaidyanathan, Srinivas Sridharan, Dhiraj D. Kalamkar, Bharat Kaul, Pradeep Dubey:
Distributed Deep Learning Using Synchronous Stochastic Gradient Descent. CoRR abs/1602.06709 (2016)
2015
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/VaidyanathanKPH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/VaidyanathanKPH15
Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Kiran Pamnany, Jeff R. Hammond, Pavan Balaji, Dipankar Das, Jongsoo Park, Bálint Joó:
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading. SC 2015: 30:1-30:12
2014
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/VaidyanathanPKHSPKSKJD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/VaidyanathanPKHSPKSKJD14
Karthikeyan Vaidyanathan, Kiran Pamnany, Dhiraj D. Kalamkar, Alexander Heinecke, Mikhail Smelyanskiy, Jongsoo Park, Daehyun Kim, Aniruddha G. Shet, Bharat Kaul, Bálint Joó, Pradeep Dubey:
Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters. IPDPS 2014: 1083-1092
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/HeybrockJKSVWD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/HeybrockJKSVWD14
Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey:
Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors. SC 2014: 69-80
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/SridharanDK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/SridharanDK14
Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar:
Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints. SC 2014: 487-498
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/ParkSVHKLPLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/ParkSVHKLPLD14
Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey:
Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices. SC 2014: 945-955
2013
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/supercomputer/JooKVSPLDW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/supercomputer/JooKVSPLDW13
Bálint Joó, Dhiraj D. Kalamkar, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Kiran Pamnany, Victor W. Lee, Pradeep Dubey, William A. Watson III:
Lattice QCD on Intel® Xeon PhiTM Coprocessors. ISC 2013: 40-54
2012
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/KalamkarTSSKMSBKD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/KalamkarTSSKMSBKD12
Dhiraj D. Kalamkar, Joshua D. Trzasko, Srinivas Sridharan, Mikhail Smelyanskiy, Daehyun Kim, Armando Manduca, Yunhong Shu, Matt A. Bernstein, Bharat Kaul, Pradeep Dubey:
High Performance Non-uniform FFT on Modern X86-based Multi-core Systems. IPDPS 2012: 449-460
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/WilliamsKSDSSADSO12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/WilliamsKSDSSADSO12
Samuel Williams, Dhiraj D. Kalamkar, Amik Singh, Anand M. Deshpande, Brian van Straalen, Mikhail Smelyanskiy, Ann S. Almgren, Pradeep Dubey, John Shalf, Leonid Oliker:
Optimization of geometric multigrid for emerging multi- and manycore processors. SC 2012: 96
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/SmelyanskiySKSDABNMLKFG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/SmelyanskiySKSDABNMLKFG12
Mikhail Smelyanskiy, Jason Sewall, Dhiraj D. Kalamkar, Nadathur Satish, Pradeep Dubey, Nikita Astafiev, Ilya Burylov, Andrey Nikolaev, Sergey Maidanov, Shuo Li, Sunil Kulkarni, Charles H. Finan, Ekaterina Gonina:
Analysis and Optimization of Financial Analytics Benchmark on Modern Multi- and Many-core IA-Based Architectures. SC Companion 2012: 1154-1162

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2007
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ispass/KalamkarCH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ispass/KalamkarCH07
Dhiraj D. Kalamkar, Mainak Chaudhuri, Mark A. Heinrich:
Simplifying Active Memory Clusters by Leveraging Directory Protocol Threads. ISPASS 2007: 242-253

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.