โ๏ธ I recently wrote a overview article about **Model Merging**, covering nearly 30 methods with detailed mathematical derivations.
Sep 1, 2025

Please refer to this ariticle for details.
Aug 22, 2025

โ๏ธ In recent months, I have done some interesting projects, including ViT on CIFAR-10, Simple ML, ASR, etc.
Jul 1, 2025

I dove into the world of Automatic Speech Recognition (ASR) by building a Large Vocabulary Continuous Speech Recognition (LVCSR) system using the Kaldi toolkit.
Jun 25, 2025

TripDataset Machine Learning Project This project is a complete implementation of machine learning pipelines applied to the TripDataset, focusing on data preprocessing, classification, and regression tasks, including: ๐งน Data preprocessing and cleaning (handling missing values, outlier detection, normalization, and feature engineering) ๐ค Model training for classification and regression (various ML algorithms for categorical and continuous prediction tasks) ๐ Performance evaluation and metrics (accuracy, F1-score, RMSE, and other evaluation techniques) ๐ Exploratory data analysis and visualization (insightful plots for feature relationships, distribution, and model performance)
Jun 7, 2025

This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including extensive exploration.
May 31, 2025

ViT-torch: Vision Transformer on CIFAR-10 (PyTorch) This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including: ๐ฏ Model implementations with various configurations (native ViT, ResNet+ViT hybrid, different patch/heads/blocks setups, Stochastic Depth/DropPath, etc.) ๐น Training and evaluation scripts (with learning rate schedulers: Warmup/Linear/Cosine/Constant-Cosine/Warmup-Constant-Cosine) ๐งฉ Data augmentation (RandomCrop+Paste, MixUp, CutMix, RandAugment, and batch random augmentation) ๐ Visualization and analysis (attention maps, attention distance, gradient rollout, feature maps, positional embedding similarity)
May 31, 2025

PEFT from fully fine-tuning perspective
May 29, 2025

I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.
May 15, 2025

๐ I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.
May 5, 2025