- โ Generic Outputs - Most AI content generators produce one-size-fits-all content without understanding individual preferences
- โ Single Modality - Tools typically generate only text OR images OR videos
- โ Platform Isolation - No integration with real social platforms for authentic user understanding
๐ฏ Recommendation Engine, Multimodal Analyzer, Profile Generator, and Content Creator All-in-One! We're building a future where AI truly understands YOU. ๐ก Simply connect your social accounts. Personalized-AIGC autonomously analyzes your behavior, builds your preference profile, and generates tailor-made contentโvideos, posts, and more! ๐
|
Automatic Preference Learning Transform raw user behavior data from social platforms into comprehensive preference profiles through multimodal LLM analysis of videos, images, and text. |
AI Video Production Studio Generate personalized video content including crosstalk, talk shows, music videos, meme edits, and news summaries tailored to user interests. |
Publication-Ready Posts Create engaging social media posts with AI-generated copy, images, and platform-specific links for Bilibili, Xiaohongshu, Zhihu, Douyin, and Weibo. |
30+ SOTA Models Leverage self-supervised learning recommendation with LightGCN, SGL, BERT4Rec, and more for precise user preference prediction. |
The Challenges We Solve:
| Challenge | Traditional Approach | Our Solution |
|---|---|---|
| ๐ญ User Understanding | Manual input of preferences | Automatic behavior analysis from social platforms |
| ๐ง Content Ideation | Generic prompts | Profile-driven creative idea generation |
| ๐ฌ Video Production | Single clips, no consistency | Multi-agent studio with 7 video types |
| ๐ฑ Social Posts | Text-only generation | Rich multimedia with platform links |
| ๐ Real-time Adaptation | Static profiles | Dynamic profiling from live data |
| ๐ง Zero Manual Input | ๐ฌ Multi-Modal Output | ๐ Platform Native | ๐ค SOTA RecSys | โก Real-Time |
|---|---|---|---|---|
| Automatic Profiling | Video + Text + Image | Bilibili, Xiaohongshu, Hupu | 30+ SSL Models | Live Data Analysis |
| No need to describe yourselfโwe learn from your actual behavior across platforms | Generate complete video productions, social media posts with images, and cross-platform content | Native integration with major Chinese social platforms for authentic user data | Powered by SSLRec (WSDM'24) with LightGCN, BERT4Rec, SGL, and more | Continuously update profiles with fresh user interaction data |
Personalized-AIGC is a multi-agent content generation framework that transforms user behavioral data into personalized creative content. Our system seamlessly bridges recommendation systems with AIGC, allowing AI to truly understand and create for individual users.
|
๐ท๏ธ DATA COLLECTION LAYER ๐บ Bilibili Spider โข ๐ Xiaohongshu Spider โข ๐ Hupu Spider โข ๐ฌ Douban Spider |
||
|
๐ฎ RECOMMENDATION ENGINE (SSLRec) LightGCN โข SGL โข SimGCL โข BERT4Rec โข DuoRec โข 30+ Models |
||
|
๐ฅ VIDEO ANALYST Frame Extraction โข Scene Understanding โข Content Summarization |
๐ผ๏ธ IMAGE ANALYST Visual Recognition โข Style Analysis โข Object Detection |
|
|
๐ง PROFILE GENERATION Item Profiles โข User Profiles โข Preference Ranking โข Creative Ideas |
||
|
๐ฌ VIDEO AGENT Cross Talk โข Talk Show โข MAD/SVC โข Rhythm Edit โข News โข Commentary |
๐ฑ TEXT-IMAGE GENERATOR AI Copywriting โข Image Generation โข Platform Links โข HTML Posts |
|
|
โจ OUTPUT LAYER ๐ฌ Personalized Videos โข ๐ฑ Social Media Posts โข ๐ผ๏ธ Generated Images โข ๐ User Insights |
||
|
๐ญ Cross Talk
|
Built with cutting-edge tools:
๐ฃ๏ธ CosyVoice โ Voice Cloning & TTS
๐ Fish-Speech โ Multilingual Speech Synthesis
๐ ImageBind โ Multimodal Embedding
๐ง Demucs โ Audio Source Separation
๐ผ Whisper โ Speech Recognition
OS: Linux, Windows
Python: 3.10+
CUDA: 12.1+ (recommended for GPU acceleration)
git clone https://github.com/your-username/Personalized-AIGC.git
cd Personalized-AIGC
# Create virtual environment
python -m venv paigc_env
source paigc_env/bin/activate # Linux/Mac
# or
.\paigc_env\Scripts\activate # Windows
# Install dependencies
pip install -e .Create a .env file in the project root:
# Dataset: bilibili, redbook, hupu, douban
DATASET=bilibili
# LLM Configuration
CHAT_API_KEY=your_openai_api_key
CHAT_BASE_URL=https://api.openai.com/v1
CHAT_MODEL=gpt-4o
# Image Generation (Optional)
IMAGE_API_KEY=your_image_api_key
IMAGE_BASE_URL=https://generativelanguage.googleapis.com/v1beta
IMAGE_MODEL=gemini-2.0-flashRun the complete pipeline with a single command:
python main.pyThis will:
- ๐ฎ Run SSL recommendation model
- ๐ง Analyze user content (video, image, text)
- ๐ Generate user profiles
- โจ Create personalized content
from tools.bilibili_realtime import BilibiliRealTime
# Collect user favorites from Bilibili
bilibili = BilibiliRealTime()
bilibili.process_all_users(save_data=True, download_videos=True)from tools.sslrec import run_sslrec
# Run self-supervised recommendation
run_sslrec()from tools.analyze import analyze
# Analyze all users' content with concurrent processing
analyze(max_workers=5)from tools.product import product
# Generate personalized video and text-image content
product()