Senior Undergraduate at Xidian University · Electronic Information Engineering
🏁 Actively seeking Ph.D./Research Assistant positions in World Model & Embodied AI for Fall 2026 or later!
I'm a senior undergraduate at Xidian University, majoring in Electronic Information Engineering. My research journey started with computer vision and multimodal perception — I worked with Dr. Hao Li on adversarial robustness through reinforcement learning, with Prof. Rui Yang on ML-driven optimization for RF/microwave systems, and with Prof. Jie Li on RGB-Thermal multimodal object tracking, where I explored how to adaptively fuse information from different sensory modalities under varying conditions.
Currently, I'm interning at Li Auto, working on code-agent architectures and LLM post-training, which has given me a deeper understanding of how large models learn to reason, plan, and interact with environments. Looking ahead, I'm especially excited about World Models and Embodied AI — building systems that can not only perceive and reason about the physical world, but also take actions within it. I see a natural progression from multimodal perception to agentic reasoning to physical intelligence, and that's the trajectory I want to pursue in my Ph.D. research.
I scored IELTS 8.0 (Listening 9.0, Speaking 7.0), and I'm comfortable working and communicating in English-speaking research environments.
Huge THANKS to Alma and Claude for helping me build this website and making differences to the world!
Currently organizing and uploading the code and related files step by step(^-^)
Applied contrastive learning with an improved ResNet-18 architecture for multispectral satellite imagery classification. Implemented adaptive image enhancement and multi-scale feature fusion, achieving a +32% accuracy improvement over the baseline model.
Designed a residual-based generator-discriminator architecture with sub-pixel convolution for 4× image upscaling. Leveraged VGG-19 perceptual loss to preserve texture realism, producing visually sharper and more detailed super-resolved images.
Built a text-mining quantitative factor library from financial news, extracting sentiment and event-driven signals. Employed entropy weighting and correlation analysis to construct a robust backtesting framework for factor evaluation.
Implemented a Swin-Unet framework for MRI/CT medical image segmentation, achieving 94% accuracy. Applied CUDA acceleration to optimize inference, resulting in a 2.5× speedup in processing time.
Developed a GA-optimized SVM combined with K-means clustering for student financial risk detection. Used PCA to reduce 17 behavioral indicators, achieving 76% accuracy on a dataset of 4,000+ students.
Contributed to university-level media and promotional content creation, helping shape the public image and outreach of the department through articles, graphics, and event coverage.
Mentoring junior students in academic planning and course guidance, sharing insights on research opportunities, study strategies, and navigating the engineering curriculum.