
Captis
Scope:
Back EndFront EndDevelopment of a data-driven medical research documentation platform, enhancing data integrity and user experience.
At Celegence, I contributed to the design and development of a data-driven medical research documentation platform used by global regulatory teams to manage complex compliance and scientific documentation workflows. The platform supports the creation, comparison, and management of thousands of research documents while maintaining strict data integrity and auditability.
Project Goals & Context
The challenge was to modernize Celegenceβs existing research documentation system into a robust, scalable, and version-controlled platform capable of handling large data volumes and ensuring regulatory traceability. The solution needed to combine advanced content editing, high-speed search, and structured data ingestionβall within a clean and intuitive user interface.
Key Contributions & Technical Achievements
Content Versioning & Diff Tracking
- Designed and implemented a document versioning system using diff-based content storage, allowing users to track and compare changes across document revisions with high accuracy.
- Integrated ProseMirror / TipTap editor extensions, enabling domain-specific markup, citation linking, and inline validation for medical content.
- Ensured data integrity through robust version control logic and audit trail mechanisms.
High-Performance Search on Large Datasets
- Implemented a fast, full-text search engine on MongoDB collections exceeding 30 million records, optimizing indexes and query performance for near-instant retrieval.
- Reduced average query response times from seconds to milliseconds through efficient caching and optimized schema design.
Large-Scale Data Ingestion & Processing
- Engineered a dedicated ingestion service for parsing and transforming large XML datasets (hundreds of MBs per file) from external data sources.
- Implemented validation, transformation, and deduplication pipelines to ensure data consistency and accuracy before import into the production database.
- Automated data import workflows with job queues and monitoring to ensure reliability under high load.
Frontend Development & User Experience
- Built modular React components integrated with MobX state management for high responsiveness and maintainability.
- Focused on usability, accessibility, and visual consistency across complex workflows involving nested data structures and multi-document operations.
Results & Impact
- Delivered a scalable, auditable documentation system that improved data accuracy and research traceability.
- Enabled regulatory teams to search, review, and update documents up to 80% faster than in the legacy system.
- Improved system reliability and reduced ingestion processing time for large XML datasets through efficient service design.