Product portfolio
A selection of products from my time on Capital One's AI/ML platform team and the data science group at Alio, from exploratory research and zero-to-one features to scaled infrastructure serving 100M+ customers.
Led identity verification for Capital One's Online Account Opening flow, replacing the multi-day Government ID process with a real-time tap-to-verify experience. Applicants confirm their identity via a link on their phones, eliminating friction for the 85k+ applicants per year previously routed to manual review and driving incremental account bookings at scale.
Data scientists were previously tacking version suffixes onto feature names, flooding the platform with near duplicates and making reuse risky. Introduced first standardized versioning policy for data products, with automated change classification, tiered review flows, and a full audit trail to augment data product lifecycle governance.
Built Feature Lineage to document the full lifecycle of data from origination through to model use, tracking both dataset-level flow and element-level field transformations across pipelines. Gave data scientists instant visibility into where features came from and how they were computed, enabling faster knowledge transfer, safer feature reuse, and audit-ready documentation for compliance requests.
Created event-based triggers for feature compute to power Capital One's Feed recommender model, powering personalized tile ranking for millions of customers. Enabled personalization features capturing customer engagement signals (accept, like, dislike, dismiss, postpone), and interaction counts, allowing the model to surface the right offers and actions to each customer at the right moment.
Built a machine learning model to predict blood hematocrit non-invasively from raw PPG sensor data at Alio. The company's wearable SmartPatch uses infrared light to continuously monitor dialysis patients at home. Replaced a heuristic ratio-of-ratios baseline with a random forest regressor trained on AC and DC amplitude features across IR channels — reducing prediction error by 80% and establishing a full pipeline from MongoDB sensor reads through AWS SageMaker.