default search action

combined dblp search
author search
venue search
publication search

ask others

Xianwei Zhang 0001

XianWei Zhang 0001

> Home > Persons

Person information

affiliation: Sun Yat-sen University, School of Computer Science and Engineering, Guangzhou, China
affiliation: AMD Inc., Sunnyvale, CA, USA
affiliation: University of Pittsburgh, Computer Science Department, Pittsburgh, PA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ccfthpc/LiangHZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ccfthpc/LiangHZ25
Hengzhong Liang, Han Huang, Xianwei Zhang:
SuCL: supply unified communication layer to improve SYCL-based heterogeneous computing. CCF Trans. High Perform. Comput. 7(3): 211-225 (2025)
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/aspdac/Xi0H0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aspdac/Xi0H0025
Mengyue Xi, Tianyu Guo, Xuanteng Huang, Zejia Lin, Xianwei Zhang:
Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs. ASP-DAC 2025: 1209-1215
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/HuangDXZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/HuangDXZ25
Xuanteng Huang, Jiangsu Du, Nong Xiao, XianWei Zhang:
PASK: Cold Start Mitigation for Inference with Proactive and Selective Kernel Loading on GPUs. DAC 2025: 1-7
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/WuLXZPZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/WuLXZPZL25
Kan Wu, Zejia Lin, Mengyue Xi, Zhongchun Zheng, Wenxuan Pan, Xianwei Zhang, Yutong Lu:
GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving. DAC 2025: 1-7
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/europar/XiHZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/europar/XiHZ25
Mengyue Xi, Jingyi He, Xianwei Zhang:
CacheC: LLM-Based GPU Cache Management to Enhance Kernel Concurrency. Euro-Par (2) 2025: 118-131
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/europar/GuoDLLLXZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/europar/GuoDLLLXZ25
Tianyu Guo, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao, Xianwei Zhang:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. Euro-Par (2) 2025: 335-348
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/GuCCD000L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/GuCCD000L25
Yuhao Gu, Haoquan Chen, Xianjie Chen, Jiangsu Du, Zhiguang Chen, Nong Xiao, Xianwei Zhang, Yutong Lu:
coMtainer: Compilation-assisted HPC Container Images with Enhanced Adaptability. SC 2025: 586-601
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/00090D00L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/00090D00L25
Tianyu Guo, Xianwei Zhang, Jiangsu Du, Zhiguang Chen, Nong Xiao, Yutong Lu:
gLLM: Global Balanced Pipeline Parallelism Systems for Distributed LLMs Serving with Token Throttling. SC 2025: 1725-1741
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/HuangXF000L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/HuangXF000L25
Han Huang, Jiabin Xie, Guangnan Feng, Xianwei Zhang, Dan Huang, Zhiguang Chen, Yutong Lu:
HStencil: Matrix-Vector Stencil Computation with Interleaved Outer Product and MLA. SC 2025: 1816-1829
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/www/GuCDZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/GuCDZ025
Yuhao Gu, Chunyu Chen, Jiangsu Du, Xiaoxi Zhang, Xianwei Zhang:
ORFA: Exploring WebAssembly as a Turing Complete Query Language for Web APIs. WWW 2025: 1856-1865
[d1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/11/GuoDLLLXZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/GuoDLLLXZ25
Tianyu Guo, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao, Xianwei Zhang:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. Zenodo, 2025
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14775
Tianyu Guo, Xianwei Zhang, Jiangsu Du, Zhiguang Chen, Nong Xiao, Yutong Lu:
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling. CoRR abs/2504.14775 (2025)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-19516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-19516
Zejia Lin, Hongxin Xu, Guanyi Chen, Xianwei Zhang, Yutong Lu:
Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration. CoRR abs/2504.19516 (2025)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-21889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-21889
Tianyu Guo, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao, Xianwei Zhang:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. CoRR abs/2505.21889 (2025)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-00487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-00487
Yuhao Gu, Zhongchun Zheng, Nong Xiao, Yutong Lu, Xianwei Zhang:
Partial Cross-Compilation and Mixed Execution for Accelerating Dynamic Binary Translation. CoRR abs/2512.00487 (2025)
2024
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/GuoHWZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/GuoHWZX24
Tianyu Guo, Xuanteng Huang, Kan Wu, Xianwei Zhang, Nong Xiao:
SMILE: LLC-based Shared Memory Expansion to Improve GPU Thread Level Parallelism. DAC 2024: 45:1-45:6
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ShanHZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ShanHZZ24
Zhaowen Shan, Xuanteng Huang, Zheng Zhou, Xianwei Zhang:
openLG: A Tunable and Efficient Open-source LSTM on GPUs. IJCNN 2024: 1-8
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/lctrts/LinSZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lctrts/LinSZL24
Zejia Lin, Aoyuan Sun, Xianwei Zhang, Yutong Lu:
MixPert: Optimizing Mixed-Precision Floating-Point Emulation on GPU Integer Tensor Cores. LCTES 2024: 34-45
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/nas/ZhengWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nas/ZhengWZ24
Zhongchun Zheng, Yuan Wu, Xianwei Zhang:
mLOOP: Optimize Loop Unrolling in Compilation with a ML-based Approach. NAS 2024: 1-8
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/WeiDJSZHXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/WeiDJSZHXL24
Yuanxin Wei, Jiangsu Du, Jiazhi Jiang, Xiao Shi, Xianwei Zhang, Dan Huang, Nong Xiao, Yutong Lu:
APTMoE: Affinity-Aware Pipeline Tuning for MoE Models on Bandwidth-Constrained GPU Nodes. SC 2024: 90
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/fgcs/ZhangGWZLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fgcs/ZhangGWZLZ23
Xi Zhang, Xiaohu Guo, Yue Weng, Xianwei Zhang, Yutong Lu, Zhong Zhao:
Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system. Future Gener. Comput. Syst. 139: 1-16 (2023)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/LinMHZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/LinMHZL23
Zejia Lin, Zewei Mo, Xuanteng Huang, Xianwei Zhang, Yutong Lu:
KeSCo: Compiler-based Kernel Scheduling for Multi-task GPU Applications. ICCD 2023: 247-254
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icpads/HuangLLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpads/HuangLLZ23
Lianghong Huang, Zejia Lin, Wei Liu, Xianwei Zhang:
Hay: Enhancing GPU Sharing Performance With Two-Level Scheduling for Ray. ICPADS 2023: 2865-2868
2022
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ccgrid/WengGZZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccgrid/WengGZZL22
Yue Weng, Tianao Ge, Xi Zhang, Xianwei Zhang, Yutong Lu:
RAISE: Efficient GPU Resource Management via Hybrid Scheduling. CCGRID 2022: 685-695
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cf/MoLZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cf/MoLZL22
Zewei Mo, Zejia Lin, Xianwei Zhang, Yutong Lu:
moTuner: a compiler-based auto-tuning approach for mixed-precision operators. CF 2022: 94-102
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/lctrts/GeMWZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lctrts/GeMWZL22
Tianao Ge, Zewei Mo, Kan Wu, Xianwei Zhang, Yutong Lu:
RollBin: reducing code-size via loop rerolling at binary level. LCTES 2022: 99-110
2020
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/memsys/ZhangS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/memsys/ZhangS20
Xianwei Zhang, Evgeny Shcherbakov:
DELTA: Validate GPU Memory Profiling with Microbenchmarks. MEMSYS 2020: 97-104

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/aspdac/ZhangWZY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aspdac/ZhangWZY19
Xianwei Zhang, Rujia Wang, Youtao Zhang, Jun Yang:
Boosting chipkill capability under retention-error induced reliability emergency. ASP-DAC 2019: 400-405
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iiswc/TaZGB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iiswc/TaZGB19
Tuan Ta, Xianwei Zhang, Anthony Gutierrez, Bradford M. Beckmann:
Autonomous Data-Race-Free GPU Testing. IISWC 2019: 81-92
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iiswc/AlsopZYBSBDGKLP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iiswc/AlsopZYBSBDGKLP19
Johnathan Alsop, Xianwei Zhang, Tsung Tai Yeh, Bradford M. Beckmann, Matthew D. Sinclair, Srikant Bharadwaj, Alexandru Dutu, Anthony Gutierrez, Onur Kayiran, Michael LeBeane, Brandon Potter, Sooraj Puthoor:
Optimizing GPU Cache Policies for MI Workloads. IISWC 2019: 243-248
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-00134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-00134
Johnathan Alsop, Matthew D. Sinclair, Srikant Bharadwaj, Alexandru Dutu, Anthony Gutierrez, Onur Kayiran, Michael LeBeane, Sooraj Puthoor, Xianwei Zhang, Tsung Tai Yeh, Bradford M. Beckmann:
Optimizing GPU Cache Policies for MI Workloads. CoRR abs/1910.00134 (2019)
2018
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/GutierrezBDGLKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/GutierrezBDGLKK18
Anthony Gutierrez, Bradford M. Beckmann, Alexandru Dutu, Joseph Gross, Michael LeBeane, John Kalamatianos, Onur Kayiran, Matthew Poremba, Brandon Potter, Sooraj Puthoor, Matthew D. Sinclair, Mark Wyse, Jieming Yin, Xianwei Zhang, Akshay Jain, Timothy G. Rogers:
Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level. HPCA 2018: 608-619
2017
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/todaes/ZhangZCY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/todaes/ZhangZCY17
XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
On the Restore Time Variations of Future DRAM Memory. ACM Trans. Design Autom. Electr. Syst. 22(2): 26:1-26:24 (2017)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/ZhangZCY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/ZhangZCY17
XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
DrMP: Mixed Precision-Aware DRAM for High Performance Approximate and Precise Computing. PACT 2017: 53-63
2016
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/ZhangZCY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/ZhangZCY16
XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
Restore truncation for performance improvement in future DRAM systems. HPCA 2016: 543-554
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/memsys/ZhangZCY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/memsys/ZhangZCY16
XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
AWARD: Approximation-aWAre Restore in Further Scaling DRAM. MEMSYS 2016: 322-324
2015
[c5]
- view
- export record
  dblp key:
  - conf/date/ZhangZCY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/date/ZhangZCY15
XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
Exploiting DRAM restore time variations in deep sub-micron scaling. DATE 2015: 477-482
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/ZhangZY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/ZhangZY15
XianWei Zhang, Youtao Zhang, Jun Yang:
DLB: Dynamic lane borrowing for improving bandwidth and performance in Hybrid Memory Cube. ICCD 2015: 125-132
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/ZhangZZY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/ZhangZZY15
XianWei Zhang, Lei Zhao, Youtao Zhang, Jun Yang:
Exploit common source-line to construct energy efficient domain wall memory based caches. ICCD 2015: 157-163
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/ZhangZY15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/ZhangZY15a
XianWei Zhang, Youtao Zhang, Jun Yang:
TriState-SET: Proactive SET for improved performance of MLC phase change memories. ICCD 2015: 659-665
2013
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/islped/ZhangJZZ013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/islped/ZhangJZZ013
XianWei Zhang, Le Jang, Youtao Zhang, Chuanjun Zhang, Jun Yang:
WoM-SET: Low power proactive-SET-based PCM write using WoM code. ISLPED 2013: 217-222

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.