


default search action
Xianwei Zhang 0001
Person information
- affiliation: Sun Yat-sen University, School of Computer Science and Engineering, Guangzhou, China
- affiliation: AMD Inc., Sunnyvale, CA, USA
- affiliation: University of Pittsburgh, Computer Science Department, Pittsburgh, PA, USA
Other persons with the same name
- Xianwei Zhang (aka: XianWei Zhang, Xian-Wei Zhang) — disambiguation page
- Xianwei Zhang 0002
— East China Jiaotong University, School of Electrical and Automation Engineering, Nanchang, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j3]Hengzhong Liang, Han Huang, Xianwei Zhang
:
SuCL: supply unified communication layer to improve SYCL-based heterogeneous computing. CCF Trans. High Perform. Comput. 7(3): 211-225 (2025)
[c32]Mengyue Xi
, Tianyu Guo
, Xuanteng Huang
, Zejia Lin
, Xianwei Zhang
:
Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs. ASP-DAC 2025: 1209-1215
[c31]Xuanteng Huang, Jiangsu Du, Nong Xiao, XianWei Zhang:
PASK: Cold Start Mitigation for Inference with Proactive and Selective Kernel Loading on GPUs. DAC 2025: 1-7
[c30]Kan Wu, Zejia Lin, Mengyue Xi, Zhongchun Zheng
, Wenxuan Pan, Xianwei Zhang, Yutong Lu:
GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving. DAC 2025: 1-7
[c29]Mengyue Xi
, Jingyi He, Xianwei Zhang
:
CacheC: LLM-Based GPU Cache Management to Enhance Kernel Concurrency. Euro-Par (2) 2025: 118-131
[c28]Tianyu Guo
, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao
, Xianwei Zhang
:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. Euro-Par (2) 2025: 335-348
[c27]Yuhao Gu
, Haoquan Chen
, Xianjie Chen
, Jiangsu Du
, Zhiguang Chen
, Nong Xiao
, Xianwei Zhang
, Yutong Lu
:
coMtainer: Compilation-assisted HPC Container Images with Enhanced Adaptability. SC 2025: 586-601
[c26]Tianyu Guo
, Xianwei Zhang
, Jiangsu Du
, Zhiguang Chen
, Nong Xiao
, Yutong Lu
:
gLLM: Global Balanced Pipeline Parallelism Systems for Distributed LLMs Serving with Token Throttling. SC 2025: 1725-1741
[c25]Han Huang
, Jiabin Xie
, Guangnan Feng
, Xianwei Zhang
, Dan Huang
, Zhiguang Chen
, Yutong Lu
:
HStencil: Matrix-Vector Stencil Computation with Interleaved Outer Product and MLA. SC 2025: 1816-1829
[c24]Yuhao Gu, Chunyu Chen, Jiangsu Du
, Xiaoxi Zhang, Xianwei Zhang:
ORFA: Exploring WebAssembly as a Turing Complete Query Language for Web APIs. WWW 2025: 1856-1865
[d1]Tianyu Guo
, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao
, Xianwei Zhang
:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. Zenodo, 2025
[i5]Tianyu Guo, Xianwei Zhang, Jiangsu Du, Zhiguang Chen, Nong Xiao, Yutong Lu:
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling. CoRR abs/2504.14775 (2025)
[i4]Zejia Lin, Hongxin Xu, Guanyi Chen, Xianwei Zhang, Yutong Lu:
Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration. CoRR abs/2504.19516 (2025)
[i3]Tianyu Guo, Hande Dong, Yichong Leng, Feng Liu, Cheater Lin, Nong Xiao, Xianwei Zhang:
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse. CoRR abs/2505.21889 (2025)
[i2]Yuhao Gu, Zhongchun Zheng, Nong Xiao, Yutong Lu, Xianwei Zhang:
Partial Cross-Compilation and Mixed Execution for Accelerating Dynamic Binary Translation. CoRR abs/2512.00487 (2025)- 2024
[c23]Tianyu Guo
, Xuanteng Huang
, Kan Wu
, Xianwei Zhang
, Nong Xiao
:
SMILE: LLC-based Shared Memory Expansion to Improve GPU Thread Level Parallelism. DAC 2024: 45:1-45:6
[c22]Zhaowen Shan, Xuanteng Huang, Zheng Zhou, Xianwei Zhang:
openLG: A Tunable and Efficient Open-source LSTM on GPUs. IJCNN 2024: 1-8
[c21]Zejia Lin
, Aoyuan Sun
, Xianwei Zhang
, Yutong Lu
:
MixPert: Optimizing Mixed-Precision Floating-Point Emulation on GPU Integer Tensor Cores. LCTES 2024: 34-45
[c20]Zhongchun Zheng, Yuan Wu, Xianwei Zhang:
mLOOP: Optimize Loop Unrolling in Compilation with a ML-based Approach. NAS 2024: 1-8
[c19]Yuanxin Wei, Jiangsu Du, Jiazhi Jiang, Xiao Shi, Xianwei Zhang, Dan Huang, Nong Xiao, Yutong Lu:
APTMoE: Affinity-Aware Pipeline Tuning for MoE Models on Bandwidth-Constrained GPU Nodes. SC 2024: 90- 2023
[j2]Xi Zhang
, Xiaohu Guo, Yue Weng, Xianwei Zhang, Yutong Lu, Zhong Zhao:
Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system. Future Gener. Comput. Syst. 139: 1-16 (2023)
[c18]Zejia Lin
, Zewei Mo, Xuanteng Huang, Xianwei Zhang, Yutong Lu:
KeSCo: Compiler-based Kernel Scheduling for Multi-task GPU Applications. ICCD 2023: 247-254
[c17]Lianghong Huang, Zejia Lin, Wei Liu
, Xianwei Zhang:
Hay: Enhancing GPU Sharing Performance With Two-Level Scheduling for Ray. ICPADS 2023: 2865-2868- 2022
[c16]Yue Weng, Tianao Ge
, Xi Zhang, Xianwei Zhang, Yutong Lu:
RAISE: Efficient GPU Resource Management via Hybrid Scheduling. CCGRID 2022: 685-695
[c15]Zewei Mo, Zejia Lin
, Xianwei Zhang, Yutong Lu:
moTuner: a compiler-based auto-tuning approach for mixed-precision operators. CF 2022: 94-102
[c14]Tianao Ge
, Zewei Mo, Kan Wu, Xianwei Zhang, Yutong Lu:
RollBin: reducing code-size via loop rerolling at binary level. LCTES 2022: 99-110- 2020
[c13]Xianwei Zhang, Evgeny Shcherbakov:
DELTA: Validate GPU Memory Profiling with Microbenchmarks. MEMSYS 2020: 97-104
2010 – 2019
- 2019
[c12]Xianwei Zhang, Rujia Wang, Youtao Zhang, Jun Yang:
Boosting chipkill capability under retention-error induced reliability emergency. ASP-DAC 2019: 400-405
[c11]Tuan Ta, Xianwei Zhang, Anthony Gutierrez
, Bradford M. Beckmann:
Autonomous Data-Race-Free GPU Testing. IISWC 2019: 81-92
[c10]Johnathan Alsop, Xianwei Zhang, Tsung Tai Yeh, Bradford M. Beckmann, Matthew D. Sinclair, Srikant Bharadwaj, Alexandru Dutu, Anthony Gutierrez
, Onur Kayiran, Michael LeBeane, Brandon Potter, Sooraj Puthoor:
Optimizing GPU Cache Policies for MI Workloads. IISWC 2019: 243-248
[i1]Johnathan Alsop, Matthew D. Sinclair, Srikant Bharadwaj, Alexandru Dutu, Anthony Gutierrez, Onur Kayiran, Michael LeBeane, Sooraj Puthoor, Xianwei Zhang, Tsung Tai Yeh, Bradford M. Beckmann:
Optimizing GPU Cache Policies for MI Workloads. CoRR abs/1910.00134 (2019)- 2018
[c9]Anthony Gutierrez
, Bradford M. Beckmann, Alexandru Dutu, Joseph Gross, Michael LeBeane, John Kalamatianos, Onur Kayiran, Matthew Poremba, Brandon Potter, Sooraj Puthoor, Matthew D. Sinclair, Mark Wyse
, Jieming Yin, Xianwei Zhang, Akshay Jain, Timothy G. Rogers
:
Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level. HPCA 2018: 608-619- 2017
[j1]XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
On the Restore Time Variations of Future DRAM Memory. ACM Trans. Design Autom. Electr. Syst. 22(2): 26:1-26:24 (2017)
[c8]XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
DrMP: Mixed Precision-Aware DRAM for High Performance Approximate and Precise Computing. PACT 2017: 53-63- 2016
[c7]XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
Restore truncation for performance improvement in future DRAM systems. HPCA 2016: 543-554
[c6]XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
AWARD: Approximation-aWAre Restore in Further Scaling DRAM. MEMSYS 2016: 322-324- 2015
[c5]XianWei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang:
Exploiting DRAM restore time variations in deep sub-micron scaling. DATE 2015: 477-482
[c4]XianWei Zhang, Youtao Zhang, Jun Yang:
DLB: Dynamic lane borrowing for improving bandwidth and performance in Hybrid Memory Cube. ICCD 2015: 125-132
[c3]XianWei Zhang, Lei Zhao, Youtao Zhang, Jun Yang:
Exploit common source-line to construct energy efficient domain wall memory based caches. ICCD 2015: 157-163
[c2]XianWei Zhang, Youtao Zhang, Jun Yang:
TriState-SET: Proactive SET for improved performance of MLC phase change memories. ICCD 2015: 659-665- 2013
[c1]XianWei Zhang, Le Jang, Youtao Zhang, Chuanjun Zhang, Jun Yang:
WoM-SET: Low power proactive-SET-based PCM write using WoM code. ISLPED 2013: 217-222
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-18 23:20 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID






