Skip to content
View cpuimage's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report cpuimage

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cpuimage/README.md

Hey 👋🏽, I'm cpuimage

AI engineer working on AIGC, inference optimization, and audio/video/image algorithms.
I build real-world AI systems, accelerate models, and share open-source work here on GitHub.

If my projects help you, feel free to buy me a coffee. ☕️


⚡ What I Do | 我在做什么

  • AIGC engineering (Stable Diffusion, FLUX, SDXL, high‑res synthesis)
  • Inference optimization (TensorRT, FP16, Flash Attention, async pipelines)
  • Audio/video/image algorithms (TTS, matting, OpenGL effects)
  • Training stability & numerical optimization
  • Multi‑time CTO experience in AI companies

🧠 Professional Experience | 专业背景

  • 👨🏽‍💻 Worked at leading tech companies including
    Baidu, KingSoft, and others.
  • 🧩 Multi‑time CTO for AI companies (AIGC, image generation, inference optimization).
  • 📱 Developed algorithms for multiple applications:
  • 💡 Delivered AI‑based technical customization services and shipped several production‑level AI projects.

🚀 Research Progress & Achievements | 研究进展与成果

I work across Stable Diffusion, inference acceleration, training stability, and audio/video algorithms.

🔥 Deep Learning (Selected Works)

  • A Trimap‑Free Solution for Real‑Time Automatic Portrait Matting on Mobile Devices
  • A Robust Optimizer With Accelerated Convergence Capability in Deep Learning
  • A General and Adaptive Robust Loss Structure Scheme
  • A Robust Loss Weighting Solution For Learning Long‑Tail Data
  • Image Synthesis and Semantic Manipulation Using Stable Diffusion Networks
  • Stable Diffusion Architecture Optimization And Deployment On Mobile Devices
  • A Robust Solution For Accelerated Training Convergence And Learning Long‑Tail Data
  • Arbitrary Resolution Super‑Resolution Solution for Real‑World Images
  • Accelerate Stable Diffusion FP16 Inference Deployment with TensorRT
  • Port Stable Diffusion X4 Upscaler to TensorFlow (FP16 supported)
  • Port Stable Diffusion PromptGen (GPT‑2) to TensorFlow + ONNX Inference
  • Stable Diffusion Architectural Distillation
  • Content‑aware 3‑view Synthesis for Game Art
  • Super‑Resolution Solution based on Stable Diffusion
  • Video Editing Techniques based on Stable Diffusion
  • Port Stable Diffusion XL 1.0 to TensorFlow (FP16 supported)
  • A Plug‑And‑Play Algorithm for Asynchronous Inference with Frequency‑Domain Reconstruction
  • Stable Diffusion Inference with PyTorch Weights and WebUI‑like Features in Keras 3.x
  • FLUX.1 FP16 Inference Deployment + Low‑Memory LoRA Training
  • LLM from Scratch with PyTorch
  • Enhanced FaceFusion: Decoupled Modules & Optimized Inference
  • Ultra High‑Resolution Portrait Retouching
  • Training‑Free Universal High‑Resolution Synthesis for Any Video Model
  • Chunked Flash Attention in Keras
  • Robustness and Speed: An Adaptive, Efficient Optimizer for Stable Training
    • Learning‑Rate‑Free
    • Warmup‑Free
    • Normalization‑Free
    • Corrected Gradient Accumulation
    • Long‑Tailed Gradient Mitigation
    • Accelerated Convergence
    • Memory‑Efficient
  • Loss Regularization for Better Generalization
  • Dynamic Loss Weighting for Multi‑Task Learning
  • Parameter‑Free Weight Regularization
  • Adaptive Moving‑Average BatchNorm Stabilization
  • Memory‑Efficient LLM Training
  • Numerical Stability via Scalable Parallel Compensated Reductions
  • MozzyTokenizer: Adaptive Byte‑Level Tokenizer

📊 Statistical Algorithms

  • Real‑time MMSE‑STSA speech enhancement (embedded implementation)

🤝 Collaboration & Contact | 合作与联系

I’m open to collaboration on AIGC, inference optimization, and audio/image algorithms.

Reach me on:

  • Telegram Badge
  • Wechat Badge
  • QQ Badge

For paid technical services or consulting:

  • mail Badge

Pinned Loading

  1. chunked-flash-attention-keras chunked-flash-attention-keras Public

    Implementation of Chunked Flash Attention in Keras

    Python 1

  2. CelebAHairMask-HQ CelebAHairMask-HQ Public

    A large-scale face dataset for hair segmentation, hair recognition, and GANs for hair generation and editing.

    87 7

  3. minSDXLTF minSDXLTF Public

    Stable Diffusion XL Inference With PyTorch Weights And More Features Like Stable Diffusion Web UI In Keras 3.x

    Python 7 1

  4. minSDTF minSDTF Public

    Stable Diffusion V1.5 Inference With PyTorch Weights And More Features Like Stable Diffusion Web UI In Keras 3.x

    Python 16 2

  5. resampler resampler Public

    A Simple and Efficient Audio Resampler Implementation in C

    C 157 69

  6. WebRTC_NS WebRTC_NS Public

    Noise Suppression Module Port From WebRTC

    C 344 161