Portfolio

Real-Time Multimodal Digital Human Permalink

Built a real-time multimodal digital human for dialogue and music using full-stack development. The system ingests text, audio and video in real time, uses face recognition to personalize responses and manage attention, and matches lip and facial movements to generated speech. The music agent tracks emotional state from voice and image, logs these states for experiments, and queries a backend database to select tracks that match the user’s emotion and request. Implemented the web frontend in JavaScript, and the system is now in production at a provincial court.

Scene Art Interactive Web Permalink

An interactive art experience built with modern web technologies.

Music Database System Permalink

A comprehensive music database with emotion-tracking and intelligent music selection.

Guoguo Zhang

Portfolio

Real-Time Multimodal Digital Human Permalink

Scene Art Interactive Web Permalink

Music Database System Permalink