


default search action
Yiren Song
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j2]Benhong Zhang
, Yiren Song, Yidong Zhang, Xiang Bi:
Two-dimensional normalized knowledge distillation leveraging class relations. J. Vis. Commun. Image Represent. 112: 104557 (2025)
[c13]Yuxuan Zhang, Qing Zhang, Yiren Song, Jichao Zhang, Hao Tang, Jiaming Liu:
Stable-Hair: Real-World Hair Transfer via Diffusion Model. AAAI 2025: 10348-10356
[c12]Yiren Song, Pei Yang, Hai Ci, Mike Zheng Shou:
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation. CVPR 2025: 3019-3028
[c11]Yepeng Liu, Yiren Song, Hai Ci, Yu Zhang, Haofan Wang, Mike Zheng Shou, Yuheng Bu:
Image Watermarks are Removable using Controllable Regeneration from Clean Noise. ICLR 2025
[c10]Hai Ci, Yiren Song, Pei Yang, Jinheng Xie, Mike Zheng Shou:
WMAdapter: Adding WaterMark Control to Latent Diffusion Models. ICML 2025
[c9]Yuxuan Zhang, Yirui Yuan, Yiren Song, Jiaming Liu:
StableMakeup: When Real-World Makeup Transfer Meets Diffusion Model. SIGGRAPH (Conference Paper Track) 2025: 68:1-68:9
[i42]Hailong Guo, Bohan Zeng, Yiren Song, Wentao Zhang, Chuang Zhang, Jiaming Liu:
Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks. CoRR abs/2501.15891 (2025)
[i41]Yiren Song, Danze Chen, Mike Zheng Shou:
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer. CoRR abs/2502.01105 (2025)
[i40]Yiren Song, Cheng Liu, Mike Zheng Shou:
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation. CoRR abs/2502.01572 (2025)
[i39]Shijie Huang, Yiren Song, Yuxuan Zhang, Hailong Guo, Xueyin Wang, Mike Zheng Shou, Jiaming Liu:
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data. CoRR abs/2502.14397 (2025)
[i38]Yuxuan Zhang, Yirui Yuan, Yiren Song, Haofan Wang, Jiaming Liu:
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer. CoRR abs/2503.07027 (2025)
[i37]Xuewei Chen, Zhimin Chen, Yiren Song:
TransAnimate: Taming Layer Diffusion to Generate RGBA Video. CoRR abs/2503.17934 (2025)
[i36]Xiaojun Ye, Chun Wang, Yiren Song, Sheng Zhou, Liangcheng Li, Jiajun Bu:
FocusedAD: Character-centric Movie Audio Description. CoRR abs/2504.12157 (2025)
[i35]Yiren Song, Cheng Liu, Mike Zheng Shou:
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data. CoRR abs/2505.18445 (2025)
[i34]Chun Wang, Xiaoran Pan, Zihao Pan, Haofan Wang, Yiren Song:
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains. CoRR abs/2505.18700 (2025)
[i33]Zitong Wang, Hang Zhao, Qianyu Zhou, Xuequan Lu, Xiangtai Li, Yiren Song:
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers. CoRR abs/2505.21541 (2025)
[i32]Runnan Lu, Yuxuan Zhang, Jiaming Liu, Haofan Wang, Yiren Song:
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering. CoRR abs/2505.24417 (2025)
[i31]Siqi Hui, Yiren Song, Sanping Zhou, Ye Deng, Wenli Huang, Jinjun Wang:
Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack. CoRR abs/2506.01011 (2025)
[i30]Yan Gong, Yiren Song, Yicheng Li, Chenglin Li, Yin Zhang:
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers. CoRR abs/2506.02528 (2025)
[i29]Zonglin Wu, Yule Xue, Xin Wei, Yiren Song:
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks. CoRR abs/2506.05982 (2025)
[i28]Wenda Shi, Yiren Song, Zihan Rao, Dengming Zhang, Jiaming Liu, Xingxing Zou:
WordCon: Word-level Typography Control in Scene Text Rendering. CoRR abs/2506.21276 (2025)
[i27]Yuxin Jiang, Yuchao Gu, Yiren Song, Ivor W. Tsang, Mike Zheng Shou:
Personalized Vision via Visual In-Context Learning. CoRR abs/2509.25172 (2025)
[i26]Yiqing Shi, Yiren Song, Mike Zheng Shou:
Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers. CoRR abs/2511.18673 (2025)
[i25]Yaoli Liu, Ziheng Ouyang, Shengtao Lou, Yiren Song:
OmniRefiner: Reinforcement-Guided Local Diffusion Refinement. CoRR abs/2511.19990 (2025)
[i24]Ziheng Ouyang, Yiren Song, Yaoli Liu, Shihao Zhu, Qibin Hou, Ming-Ming Cheng, Mike Zheng Shou:
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment. CoRR abs/2511.20614 (2025)
[i23]Quanjian Song, Yiren Song, Kelly Peng, Yuan Gao, Mike Zheng Shou:
WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation. CoRR abs/2511.22098 (2025)
[i22]Pei Yang, Yepeng Liu, Kelly Peng, Yuan Gao, Yiren Song:
TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance. CoRR abs/2512.01314 (2025)
[i21]Pei Yang, Hai Ci, Yiren Song, Mike Zheng Shou:
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale. CoRR abs/2512.04537 (2025)
[i20]Cheng Liu, Yiren Song, Haofan Wang, Mike Zheng Shou:
OmniPSD: Layered PSD Generation with Diffusion Transformer. CoRR abs/2512.09247 (2025)
[i19]Hai Ci, Xiaokang Liu, Pei Yang, Yiren Song, Mike Zheng Shou:
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos. CoRR abs/2512.09406 (2025)
[i18]Yuanhang Li, Yiren Song, Junzhe Bai, Xinran Liang, Hu Yang, Libiao Jin, Qi Mao:
IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning. CoRR abs/2512.15635 (2025)
[i17]Yiren Song, Cheng Liu, Weijia Mao, Mike Zheng Shou:
Mitty: Diffusion-based Human-to-Robot Video Generation. CoRR abs/2512.17253 (2025)
[i16]Mingcheng Ye, Jiaming Liu, Yiren Song:
Loom: Diffusion-Transformer for Interleaved Generation. CoRR abs/2512.18254 (2025)- 2024
[c8]Yuxuan Zhang, Yiren Song, Jiaming Liu, Rui Wang, Jinpeng Yu, Hao Tang, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing:
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation. CVPR 2024: 8069-8078
[c7]Hai Ci
, Pei Yang, Yiren Song
, Mike Zheng Shou
:
RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-key Identification. ECCV (28) 2024: 338-354
[c6]Yuxuan Zhang, Yiren Song, Jinpeng Yu, Han Pan, Zhongliang Jing:
Fast Personalized Text to Image Synthesis with Attention Injection. ICASSP 2024: 6195-6199
[c5]Pei Yang, Hai Ci, Yiren Song, Mike Zheng Shou:
Can Simple Averaging Defeat Modern Watermarks? NeurIPS 2024
[c4]Yiren Song
, Shijie Huang
, Chen Yao
, Hai Ci
, Xiaojun Ye
, Jiaming Liu
, Yuxuan Zhang
, Mike Zheng Shou
:
ProcessPainter: Learning to draw from sequence data. SIGGRAPH Asia 2024: 18:1-18:10
[i15]Yuxuan Zhang, Lifu Wei, Qing Zhang, Yiren Song, Jiaming Liu, Huaxia Li, Xu Tang, Yao Hu, Haibo Zhao:
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model. CoRR abs/2403.07764 (2024)
[i14]Yuxuan Zhang, Yiren Song, Jinpeng Yu, Han Pan, Zhongliang Jing:
Fast Personalized Text-to-Image Syntheses With Attention Injection. CoRR abs/2403.11284 (2024)
[i13]Hai Ci, Pei Yang, Yiren Song, Mike Zheng Shou:
RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification. CoRR abs/2404.14055 (2024)
[i12]Yiren Song, Shijie Huang, Chen Yao, Xiaojun Ye, Hai Ci, Jiaming Liu, Yuxuan Zhang, Mike Zheng Shou:
ProcessPainter: Learn Painting Process from Sequence Data. CoRR abs/2406.06062 (2024)
[i11]Hai Ci, Yiren Song, Pei Yang, Jinheng Xie, Mike Zheng Shou:
WMAdapter: Adding WaterMark Control to Latent Diffusion Models. CoRR abs/2406.08337 (2024)
[i10]Pei Yang, Hai Ci, Yiren Song, Mike Zheng Shou:
Steganalysis on Digital Watermarking: Is Your Defense Truly Impervious? CoRR abs/2406.09026 (2024)
[i9]Yuxuan Zhang, Qing Zhang, Yiren Song, Jiaming Liu:
Stable-Hair: Real-World Hair Transfer via Diffusion Model. CoRR abs/2407.14078 (2024)
[i8]Yepeng Liu, Yiren Song, Hai Ci, Yu Zhang, Haofan Wang, Mike Zheng Shou, Yuheng Bu
:
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise. CoRR abs/2410.05470 (2024)
[i7]Wenda Shi, Yiren Song, Dengming Zhang, Jiaming Liu, Xingxing Zou:
FonTS: Text Rendering with Typography and Style Controls. CoRR abs/2412.00136 (2024)
[i6]Yiren Song, Shengtao Lou, Xiaokang Liu, Hai Ci, Pei Yang, Jiaming Liu, Mike Zheng Shou:
Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation. CoRR abs/2412.05980 (2024)
[i5]Cong Wang, Xiangyang Luo, Zijian Cai, Yiren Song, Yunlong Zhao, Yifan Bai, Yuhang He, Yihong Gong:
GridShow: Omni Visual Generation. CoRR abs/2412.10718 (2024)
[i4]Yiren Song, Pei Yang, Hai Ci, Mike Zheng Shou:
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation. CoRR abs/2412.11638 (2024)
[i3]Yiren Song, Xiaokang Liu, Mike Zheng Shou:
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity. CoRR abs/2412.14580 (2024)- 2023
[c3]Yiren Song, Xuning Shao, Kang Chen, Weidong Zhang, Zhongliang Jing, Minzhe Li:
CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics. AAAI 2023: 2312-2320
[i2]Yuxuan Zhang, Jiaming Liu, Yiren Song, Rui Wang, Hao Tang, Jinpeng Yu, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing:
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation. CoRR abs/2312.16272 (2023)- 2022
[j1]Minzhe Li, Zhongliang Jing, Hongyu Zhu, Yiren Song:
Multi-sensor measurement fusion based on minimum mixture error entropy with non-Gaussian measurement noise. Digit. Signal Process. 123: 103377 (2022)
[c2]Yiren Song, Yuxuan Zhang:
CLIPFont: Text Guided Vector WordArt Generation. BMVC 2022: 543
[c1]Yiren Song:
CLIPTexture: Text-Driven Texture Synthesis. ACM Multimedia 2022: 5468-5476
[i1]Yiren Song, Xuning Shao, Kang Chen, Weidong Zhang, Minzhe Li, Zhongliang Jing:
CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics. CoRR abs/2212.02122 (2022)
Coauthor Index
aka: Mike Zheng Shou

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-28 02:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







