Machine Learning

🚀 I recently wrote a overview article about **Model Merging**

✍️ I recently wrote a overview article about **Model Merging**, covering nearly 30 methods with detailed mathematical derivations.

Sep 1, 2025

Model Merging Overview

Please refer to this ariticle for details.

Aug 22, 2025

🚀 I have sorted out several projects I have done in recent months

✍️ In recent months, I have done some interesting projects, including ViT on CIFAR-10, Simple ML, ASR, etc.

Jul 1, 2025

🎙️ Auto Speech Recognition

I dove into the world of Automatic Speech Recognition (ASR) by building a Large Vocabulary Continuous Speech Recognition (LVCSR) system using the Kaldi toolkit.

Jun 25, 2025

Simple-ML

TripDataset Machine Learning Project This project is a complete implementation of machine learning pipelines applied to the TripDataset, focusing on data preprocessing, classification, and regression tasks, including: 🧹 Data preprocessing and cleaning (handling missing values, outlier detection, normalization, and feature engineering) 🤖 Model training for classification and regression (various ML algorithms for categorical and continuous prediction tasks) 📊 Performance evaluation and metrics (accuracy, F1-score, RMSE, and other evaluation techniques) 🔍 Exploratory data analysis and visualization (insightful plots for feature relationships, distribution, and model performance)

Jun 7, 2025

🧩 ViT on CIFAR-10

This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including extensive exploration.

May 31, 2025

ViT on CIFAR-10

ViT-torch: Vision Transformer on CIFAR-10 (PyTorch) This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including: 🎯 Model implementations with various configurations (native ViT, ResNet+ViT hybrid, different patch/heads/blocks setups, Stochastic Depth/DropPath, etc.) 🌹 Training and evaluation scripts (with learning rate schedulers: Warmup/Linear/Cosine/Constant-Cosine/Warmup-Constant-Cosine) 🧩 Data augmentation (RandomCrop+Paste, MixUp, CutMix, RandAugment, and batch random augmentation) 📈 Visualization and analysis (attention maps, attention distance, gradient rollout, feature maps, positional embedding similarity)

May 31, 2025

Weight Spectra Induced Efficient Model Adaptation

PEFT from fully fine-tuning perspective

May 29, 2025

🎙️ Voice Activity Detection

I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.

May 15, 2025

🎉 I have opensourced my VAD project recently

📈 I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.

May 5, 2025