ResearchSept 2023 – Jan 2025

RAG-Enhanced Decision Modeling for Patent Office Actions

AI system helping improve patent office action submission and prior-art matching using domain-specific retrieval models.

Project Overview

Mentored by professors at Tsinghua University AI Laboratories. Led a three-member research team to design a retrieval-augmented generation (RAG) based system for automating patent office action drafting and prior-art matching. Collected, standardized, and preprocessed 100,000+ multilingual patent records from USPTO and CNIPA sources, building robust text-cleaning, tokenization, and entity-extraction pipelines to ensure consistency. Developed domain-specific embeddings and semantic similarity models using transformer-based architectures (e.g., BERT, SentenceTransformers) to link patent claims with supporting evidence, improving retrieval precision by 20% over baseline. Designed a multi-stage retrieval–ranking–re-ranking framework that incorporated statistical weighting and cosine similarity thresholds, enhancing both result interpretability and legal transparency for expert review.

Technologies Used

PythonLLMRAGBERTSentenceTransformersVector SearchData EngineeringInformation Retrieval

Impact

Improved retrieval precision by 20% and enhanced interpretability in legal AI workflows