Skip to content

cpuimage/cpuimage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 

Repository files navigation

Hey 👋🏽, I'm cpuimage

AI engineer working on AIGC, inference optimization, and audio/video/image algorithms.
I build real-world AI systems, accelerate models, and share open-source work here on GitHub.

If my projects help you, feel free to buy me a coffee. ☕️


⚡ What I Do | 我在做什么

  • AIGC engineering (Stable Diffusion, FLUX, SDXL, high‑res synthesis)
  • Inference optimization (TensorRT, FP16, Flash Attention, async pipelines)
  • Audio/video/image algorithms (TTS, matting, OpenGL effects)
  • Training stability & numerical optimization
  • Multi‑time CTO experience in AI companies

🧠 Professional Experience | 专业背景

  • 👨🏽‍💻 Worked at leading tech companies including
    Baidu, KingSoft, and others.
  • 🧩 Multi‑time CTO for AI companies (AIGC, image generation, inference optimization).
  • 📱 Developed algorithms for multiple applications:
  • 💡 Delivered AI‑based technical customization services and shipped several production‑level AI projects.

🚀 Research Progress & Achievements | 研究进展与成果

I work across Stable Diffusion, inference acceleration, training stability, and audio/video algorithms.

🔥 Deep Learning (Selected Works)

  • A Trimap‑Free Solution for Real‑Time Automatic Portrait Matting on Mobile Devices
  • A Robust Optimizer With Accelerated Convergence Capability in Deep Learning
  • A General and Adaptive Robust Loss Structure Scheme
  • A Robust Loss Weighting Solution For Learning Long‑Tail Data
  • Image Synthesis and Semantic Manipulation Using Stable Diffusion Networks
  • Stable Diffusion Architecture Optimization And Deployment On Mobile Devices
  • A Robust Solution For Accelerated Training Convergence And Learning Long‑Tail Data
  • Arbitrary Resolution Super‑Resolution Solution for Real‑World Images
  • Accelerate Stable Diffusion FP16 Inference Deployment with TensorRT
  • Port Stable Diffusion X4 Upscaler to TensorFlow (FP16 supported)
  • Port Stable Diffusion PromptGen (GPT‑2) to TensorFlow + ONNX Inference
  • Stable Diffusion Architectural Distillation
  • Content‑aware 3‑view Synthesis for Game Art
  • Super‑Resolution Solution based on Stable Diffusion
  • Video Editing Techniques based on Stable Diffusion
  • Port Stable Diffusion XL 1.0 to TensorFlow (FP16 supported)
  • A Plug‑And‑Play Algorithm for Asynchronous Inference with Frequency‑Domain Reconstruction
  • Stable Diffusion Inference with PyTorch Weights and WebUI‑like Features in Keras 3.x
  • FLUX.1 FP16 Inference Deployment + Low‑Memory LoRA Training
  • LLM from Scratch with PyTorch
  • Enhanced FaceFusion: Decoupled Modules & Optimized Inference
  • Ultra High‑Resolution Portrait Retouching
  • Training‑Free Universal High‑Resolution Synthesis for Any Video Model
  • Chunked Flash Attention in Keras
  • Robustness and Speed: An Adaptive, Efficient Optimizer for Stable Training
    • Learning‑Rate‑Free
    • Warmup‑Free
    • Normalization‑Free
    • Corrected Gradient Accumulation
    • Long‑Tailed Gradient Mitigation
    • Accelerated Convergence
    • Memory‑Efficient
  • Loss Regularization for Better Generalization
  • Dynamic Loss Weighting for Multi‑Task Learning
  • Parameter‑Free Weight Regularization
  • Adaptive Moving‑Average BatchNorm Stabilization
  • Memory‑Efficient LLM Training
  • Numerical Stability via Scalable Parallel Compensated Reductions
  • MozzyTokenizer: Adaptive Byte‑Level Tokenizer

📊 Statistical Algorithms

  • Real‑time MMSE‑STSA speech enhancement (embedded implementation)

🤝 Collaboration & Contact | 合作与联系

I’m open to collaboration on AIGC, inference optimization, and audio/image algorithms.

Reach me on:

  • Telegram Badge
  • Wechat Badge
  • QQ Badge

For paid technical services or consulting:

  • mail Badge

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors