default search action

combined dblp search
author search
venue search
publication search

ask others

Sehoon Kim 0001

> Home > Persons

Person information

affiliation (PhD 2024): University of California, Berkeley, CA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c16]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/HooperKMMZPMKG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HooperKMMZPMKG25
Coleman Richard Charles Hooper, Sehoon Kim, Hiva Mohammadzadeh, Monishwaran Maheswaran, Sebastian Zhao, June Paik, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
Squeezed Attention: Accelerating Long Context Length LLM Inference. ACL (1) 2025: 32631-32652
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ErdoganL0MFAKG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ErdoganL0MFAKG25
Lutfi Eren Erdogan, Nicholas Lee, Sehoon Kim, Suhong Moon, Hiroki Furuta, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami:
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks. ICML 2025
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/TiwariXTH0HNMKG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TiwariXTH0HNMKG25
Rishabh Tiwari, Haocheng Xi, Aditya Tomar, Coleman Richard Charles Hooper, Sehoon Kim, Maxwell Horton, Mahyar Najibi, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache. ICML 2025
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-10424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-10424
Rishabh Tiwari, Haocheng Xi, Aditya Tomar, Coleman Hooper, Sehoon Kim, Maxwell Horton, Mahyar Najibi, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache. CoRR abs/2502.10424 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13575
Coleman Hooper, Sehoon Kim, Suhong Moon, Kerem Dilmen, Monishwaran Maheswaran, Nicholas Lee, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami:
ETS: Efficient Tree Search for Inference-Time Scaling. CoRR abs/2502.13575 (2025)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-09572
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-09572
Lutfi Eren Erdogan, Nicholas Lee, Sehoon Kim, Suhong Moon, Hiroki Furuta, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami:
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks. CoRR abs/2503.09572 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13059
Coleman Hooper, Sebastian Zhao, Luca Manolache, Sehoon Kim, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami:
Multipole Attention for Efficient Long Context Reasoning. CoRR abs/2506.13059 (2025)
2024
[b1]
- view
- export record
  dblp key:
  - phd/us/Kim24b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Kim24b
Sehoon Kim:
Full Stack Approach for Efficient Deep Learning Inference. University of California Berkeley, USA, 2024
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/fdata/DeianaTABGDHHLNNOPABBBBMDDFGGGHHK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fdata/DeianaTABGDHHLNNOPABBBBMDDFGGGHHK24
Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier M. Duarte, Philip C. Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci-Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bähr, Jürgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomás E. Müller-Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Dongning Guo, Kyle J. Hazelwood, Christian Herwig, Babar Khan, Sehoon Kim, Thomas Klijnsma, Yaling Liu, Kin Ho Lo, Tri Nguyen, Gianantonio Pezzullo, Seyedramin Rasoulinezhad, Ryan A. Rivera, Kate Scholberg, Justin Selig, Sougata Sen, Dmitri Strukov, William Tang, Savannah Thais, Kai Lukas Unger, Ricardo Vilalta, Belinavon Krosigk, Shen Wang, Thomas K. Warburton:
Corrigendum: Applications and techniques for fast machine learning in science. Frontiers Big Data 6 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/micro/GholamiYKHMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/micro/GholamiYKHMK24
Amir Gholami, Zhewei Yao, Sehoon Kim, Coleman Hooper, Michael W. Mahoney, Kurt Keutzer:
AI and Memory Wall. IEEE Micro 44(3): 33-39 (2024)
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LeeWKMSAMKG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LeeWKMSAMKG24
Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement. ACL (Findings) 2024: 6498-6526
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KimHGDLSMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KimHGDLSMK24
Sehoon Kim, Coleman Hooper, Amir Gholami, Zhen Dong, Xiuyu Li, Sheng Shen, Michael W. Mahoney, Kurt Keutzer:
SqueezeLLM: Dense-and-Sparse Quantization. ICML 2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KimMTLMKG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KimMTLMKG24
Sehoon Kim, Suhong Moon, Ryan Tabrizi, Nicholas Lee, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
An LLM Compiler for Parallel Function Calling. ICML 2024
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HooperKMMSKG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HooperKMMSKG24
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami:
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization. NeurIPS 2024
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-07886
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-07886
Siddharth Jha, Coleman Hooper, Xiaoxuan Liu, Sehoon Kim, Kurt Keutzer:
Learned Best-Effort LLM Serving. CoRR abs/2401.07886 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-18079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-18079
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami:
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization. CoRR abs/2401.18079 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14123
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14123
Amir Gholami, Zhewei Yao, Sehoon Kim, Coleman Hooper, Michael W. Mahoney, Kurt Keutzer:
AI and Memory Wall. CoRR abs/2403.14123 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15042
Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement. CoRR abs/2403.15042 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08892
Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim, Kurt Keutzer, Amir Gholami:
Characterizing Prompt Compression Methods for Long Context Inference. CoRR abs/2407.08892 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00608
Lutfi Eren Erdogan, Nicholas Lee, Siddharth Jha, Sehoon Kim, Ryan Tabrizi, Suhong Moon, Coleman Hooper, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami:
TinyAgent: Function Calling at the Edge. CoRR abs/2409.00608 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02141
Suhong Moon, Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim, Woosang Lim, Kurt Keutzer, Amir Gholami:
Efficient and Scalable Estimation of Tool Representations in Vector Space. CoRR abs/2409.02141 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-09688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-09688
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Monishwaran Maheswaran, June Paik, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
Squeezed Attention: Accelerating Long Context Length LLM Inference. CoRR abs/2411.09688 (2024)
2023
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KimMMMMGK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimMMMMGK23
Sehoon Kim, Karttikeya Mangalam, Suhong Moon, Jitendra Malik, Michael W. Mahoney, Amir Gholami, Kurt Keutzer:
Speculative Decoding with Big Little Decoder. NeurIPS 2023
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-07863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-07863
Sehoon Kim, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Amir Gholami, Kurt Keutzer:
Big Little Transformer Decoder. CoRR abs/2302.07863 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14017
Sehoon Kim, Coleman Hooper, Thanakul Wattanawong, Minwoo Kang, Ruohan Yan, Hasan Genc, Grace Dinh, Qijing Huang, Kurt Keutzer, Michael W. Mahoney, Yakun Sophia Shao, Amir Gholami:
Full Stack Optimization of Transformer Inference: a Survey. CoRR abs/2302.14017 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07629
Sehoon Kim, Coleman Hooper, Amir Gholami, Zhen Dong, Xiuyu Li, Sheng Shen, Michael W. Mahoney, Kurt Keutzer:
SqueezeLLM: Dense-and-Sparse Quantization. CoRR abs/2306.07629 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12072
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Hasan Genc, Kurt Keutzer, Amir Gholami, Yakun Sophia Shao:
SPEED: Speculative Pipelined Execution for Efficient Decoding. CoRR abs/2310.12072 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04511
Sehoon Kim, Suhong Moon, Ryan Tabrizi, Nicholas Lee, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
An LLM Compiler for Parallel Function Calling. CoRR abs/2312.04511 (2023)
2022
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/fdata/DeianaTABGDHHLN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fdata/DeianaTABGDHHLN22
Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier M. Duarte, Philip C. Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bähr, Jürgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomás E. Müller-Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Dongning Guo, Kyle J. Hazelwood, Christian Herwig, Babar Khan, Sehoon Kim, Thomas Klijnsma, Yaling Liu, Kin Ho Lo, Tri Nguyen, Gianantonio Pezzullo, Seyedramin Rasoulinezhad, Ryan A. Rivera, Kate Scholberg, Justin Selig, Sougata Sen, Dmitri Strukov, William Tang, Savannah Thais, Kai Lukas Unger, Ricardo Vilalta, Belinavon Krosigk, Shen Wang, Thomas K. Warburton:
Applications and Techniques for Fast Machine Learning in Science. Frontiers Big Data 5: 787421 (2022)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimGYLWNZGMK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimGYLWNZGMK22
Sehoon Kim, Amir Gholami, Zhewei Yao, Nicholas Lee, Patrick Wang, Aniruddha Nrusimha, Bohan Zhai, Tianren Gao, Michael W. Mahoney, Kurt Keutzer:
Integer-Only Zero-Shot Quantization for Efficient Speech Recognition. ICASSP 2022: 4288-4292
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/KimSTGKHK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/KimSTGKHK22
Sehoon Kim, Sheng Shen, David Thorsley, Amir Gholami, Woosuk Kwon, Joseph Hassoun, Kurt Keutzer:
Learned Token Pruning for Transformers. KDD 2022: 784-794
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KimGSLMMMK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimGSLMMMK22
Sehoon Kim, Amir Gholami, Albert E. Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer:
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition. NeurIPS 2022
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KwonKMHKG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KwonKMHKG22
Woosuk Kwon, Sehoon Kim, Michael W. Mahoney, Joseph Hassoun, Kurt Keutzer, Amir Gholami:
A Fast Post-Training Pruning Framework for Transformers. NeurIPS 2022
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/YuYGDKMK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/YuYGDKMK22
Shixing Yu, Zhewei Yao, Amir Gholami, Zhen Dong, Sehoon Kim, Michael W. Mahoney, Kurt Keutzer:
Hessian-Aware Pruning and Optimal Neural Implant. WACV 2022: 3665-3676
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-09210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-09210
Taebum Kim, Eunji Jeong, Geon-Woo Kim, Yunmo Koo, Sehoon Kim, Gyeong-In Yu, Byung-Gon Chun:
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs. CoRR abs/2201.09210 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09656
Woosuk Kwon, Sehoon Kim, Michael W. Mahoney, Joseph Hassoun, Kurt Keutzer, Amir Gholami:
A Fast Post-Training Pruning Framework for Transformers. CoRR abs/2204.09656 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00888
Sehoon Kim, Amir Gholami, Albert E. Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer:
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition. CoRR abs/2206.00888 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pvldb/YuAKPZCWI21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pvldb/YuAKPZCWI21
Gyeong-In Yu, Saeed Amizadeh, Sehoon Kim, Artidoro Pagnoni, Ce Zhang, Byung-Gon Chun, Markus Weimer, Matteo Interlandi:
WindTunnel: Towards Differentiable ML Pipelines Beyond a Single Modele. Proc. VLDB Endow. 15(1): 11-20 (2021)
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KimGYMK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KimGYMK21
Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer:
I-BERT: Integer-only BERT Quantization. ICML 2021: 5506-5518
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ispass/XuKNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ispass/XuKNS21
Jingyi Xu, Sehoon Kim, Borivoje Nikolic, Yakun Sophia Shao:
Memory-Efficient Hardware Performance Counters with Approximate-Counting Algorithms. ISPASS 2021: 226-228
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KimJKKKYC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimJKKKYC21
Taebum Kim, Eunji Jeong, Geon-Woo Kim, Yunmo Koo, Sehoon Kim, Gyeong-In Yu, Byung-Gon Chun:
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs. NeurIPS 2021: 1468-1480
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-01321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-01321
Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer:
I-BERT: Integer-only BERT Quantization. CoRR abs/2101.01321 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-13630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-13630
Amir Gholami, Sehoon Kim, Zhen Dong, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer:
A Survey of Quantization Methods for Efficient Neural Network Inference. CoRR abs/2103.13630 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16827
Sehoon Kim, Amir Gholami, Zhewei Yao, Aniruddha Nrusimha, Bohan Zhai, Tianren Gao, Michael W. Mahoney, Kurt Keutzer:
Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition. CoRR abs/2103.16827 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00910
Sehoon Kim, Sheng Shen, David Thorsley, Amir Gholami, Joseph Hassoun, Kurt Keutzer:
Learned Token Pruning for Transformers. CoRR abs/2107.00910 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13041
Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier M. Duarte, Philip C. Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bähr, Jürgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomás E. Müller-Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Kyle J. Hazelwood, Christian Herwig, Babar Khan, Sehoon Kim, Thomas Klijnsma, Yaling Liu, Kin Ho Lo, Tri Nguyen, Gianantonio Pezzullo, Seyedramin Rasoulinezhad, Ryan A. Rivera, Kate Scholberg, Justin Selig, Sougata Sen, Dmitri Strukov, William Tang, Savannah Thais, Kai Lukas Unger, Ricardo Vilalta, Belinavon Krosigk, Thomas K. Warburton, Maria Acosta Flechas, Anthony Aportela, Thomas Calvet, Leonardo Cristella, Daniel Diaz, Caterina Doglioni, Maria Domenica Galati, Elham E Khoda, Farah Fahim, Davide Giri, Benjamin Hawks, Duc Hoang, Burt Holzman, Shih-Chieh Hsu, Sergo Jindariani, Iris Johnson, Raghav Kansal, Ryan Kastner, Erik Katsavounidis, Jeffrey D. Krupa, Pan Li, Sandeep Madireddy, Ethan Marx, Patrick McCormack, Andres Meza, Jovan Mitrevski, Mohammed Attia Mohammed, Farouk Mokhtar, Eric A. Moreno, Srishti Nagu, Rohin Narayan, Noah Palladino, Zhiqiang Que, Sang Eon Park, Subramanian Ramamoorthy, Dylan S. Rankin, Simon Rothman, Ashish Sharma, Sioni Summers, Pietro Vischia, Jean-Roch Vlimant, Olivia Weng:
Applications and Techniques for Fast Machine Learning in Science. CoRR abs/2110.13041 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.