cv
This is a description of the page. You can modify it in '_pages/cv.md'. You can also change or remove the top pdf download button.
Basics
| Name | Ishan Ranjan |
| Label | Data Scientist, AI Researcher, Software Engineer |
| iranjan@bu.edu | |
| Phone | (925)-719-7349 |
| Url | https://iranjan31.github.io |
| Summary | Data professional with experience in custom-tailored computational analysis solutions and providing in-depth analysis of relevant data. |
Work
-
2026.01 - Present Senior Software Engineer Engineer
Talon
Leading data integration efforts for Third-Party-Administrator clients, powering Talon's medical price transparency and health plan rewards platform.
-
2025.08 - 2025.10 Solutions Engineer
CodeRabbit
Spent a couple months sojourning as a Solutions Engineer at CodeRabbit. Worked with a hyper-growth GTM team and learned the ins-and-outs of a successful organization. Spearheaded field::engineering support for CodeRabbit integration with Azure DevOps, and helped grow ARR by a whole lot month-over-month.
- Partnered with 5 Account Executives to nurture customer relationships and provide technical product expertise, growing ARR by 15% month over month
- Provided technical support to pre-sales and post-sales customers by responding to product-related technical questions and architecting solutions on customer end.
-
2024.12 - 2025.08 Data Engineer
John Hancock
As a Data Engineer on Manulife | John Hancock’s Personal Investing platform team, I designed and maintained end-to-end data pipelines supporting Advanced Analytics for the Business Intelligence team, with a focus on 401k to IRA rollovers. My work integrated diverse data sources — including third-party broker data (Morningstar, SS&C, Fidelity), marketing and user engagement data (Marketo, Domo), and AWS-based sales call transcripts — into unified, pipeline-driven workflows. This enabled actionable insights for stakeholders and strengthened decision-making across the platform.
- Delivered actionable insights for the Personal Investing Business Intelligence team by leading development and maintenance of 50+ Python ETL tasks.
- Reduced technical debt and improved system scalability by migrating legacy pipelines to a micro services architecture using FastAPI, Azure Data Factory, Azure Kubernetes, and SQLAlchemy.
- Engineered and deployed a NLTK-based cosine-similarity vector search application on Databricks that scans sales-call transcripts against approved IRA-Rollover scripts, ensuring real-time protocol adherence and cutting compliance-review time.
-
2024.09 - 2024.12 Graduate Teaching Assistant
Boston University College of Arts and Sciences
Teaching Assistnat for CAS CS 440, Introduction to Artificial Intelligence under Dr. Andrew Wood. Taught two discussion sections a week, assisted students during office hours, graded programming and written assignments, and contributed to solution and autograder code.
-
2024.05 - 2024.12 Graduate Researcher
Faculty of Computing and Data Sciences, Boston University
Thesis research under Dr. Thomas Gardos. Researching LLM use cases for academic curriculum advising and academic tutoring.
-
2022.01 - 2023.07 Data Engineer
ROSALIND
As a Data Engineer working with the Data Science and Platform Engineering teams, I was responsible for the development of backend workflows. I worked on all levels of the tech stack, from frontend development to pipeline implementation. I also interfaced and consulted with key industry partners to help bring their technologies to the platform.
- Enhanced data processing by developing of backend pipelines using Python and R. Improved workflow efficiency and reliability via introduction of Apache Airflow automation.
- Delivered high-performance APIs and novel visualization tools for industry partners using Django, Angular, and Flask
- Administered and maintained Google Cloud infrastructure, Cloud SQL, and POSTGRES databases to ensure optimal availability, security, and performance.
- Improved productivity and agility by playing a key role in refining Test/QA methodologies and optimizing development team's overall scrum process in conjunction with stakeholders.
-
2021.03 - 2022.01 Research Associate, Informatics
Biosplice Therapeutics
As a Research Associate working as a part of Biosplice’s Informatics group, I implemented and functionalized analysis pipelines for bulk data analysis. I helped to administer and optimize our informatics resources as Biosplice’s Linux Server Admin. I also played a key role in Biosplice’s laboratory information storage and technology by administrating our Dotmatics LIMS system.
- Enhanced research infrastructure by implementing pipelines for bulk data analysis and identification of novel alternative splicing events using Python, R, and bash.
- Streamlined data intake, backup to AWS S3 Storage, and sample quality control by developing an efficient next-generation sequencing (NGS) data stewardship system.
- Revitalized Dotmatics LIMS, delivered scalability and reliability enhancements boosting system efficiency and easing user roadblocks and human error.
- Led transition of LIMS frontend and backend to AWS EC2, ensuring smoother, more scalable cloud-based operations and hosting. Guaranteed optimal performance and reliability of critical in-house Oracle databases on Linux servers and virtual machines through administration and maintenance.
Education
-
2023.09 - 2025.01 Boston, Massachusetts
-
2017.09 - 2021.06 San Diego, California
Skills
| Languages and Libraries | |
| Python | |
| R | |
| C++ | |
| Java | |
| Bash | |
| Pandas | |
| Numpy | |
| CUDA | |
| Git | |
| Django | |
| Jinjia | |
| Angular | |
| Flask | |
| Numpy | |
| SQL |
| Machine Learning | |
| TensorFlow/Keras | |
| PyTorch | |
| Sklearn | |
| LLMs | |
| Langchain | |
| RAG Methods | |
| PEFT | |
| CNNs | |
| Transformers | |
| HuggingFace | |
| GANs | |
| Variational Autoencoders |
| Frameworks and Methods | |
| Algorithm Design | |
| Database Administration | |
| Linux/Unix | |
| MapReduce | |
| Kafka | |
| Docker | |
| Kubernetes | |
| AWS | |
| AWS Sagemaker | |
| Google Cloud Platform | |
| Oracle Cloud | |
| Oracle Databases | |
| Airflow | |
| BEAM | |
| MongoDB | |
| MySQL | |
| PostgreSQL |
Publications
-
2021.06.10 Identification of Lung and Blood Microbiota Implicated in COVID-19 Prognosis
Cells
Characterization of lung and blood microbiome and their implication on COVID-19 prognosis through analysis of peripheral blood mononuclear cell (PBMC) samples, lung biopsy samples, and bronchoalveolar lavage fluid (BALF) samples
Volunteer
-
2021.09 - Present Assistant Director of Operations
Association of South-Asian A Cappella
Assistant Director of Operations for the Association of South Asian A Cappella (desiacappella.org), a 501(c)(3) nonprofit organization.
- Led operations team of six people in building out scoring, audition, and judging processes and applications
- Implemented digital scoring processes for collegiate South Asian A Cappella competitions using Google Apps Script and Python.
- Led Development on a custom-tailored ELO-based ranking system for competitive teams to determine national championship qualification.
- Analyzed Scoring Distributions and Judge Feedback Sentiment using Seaborn, Numpy, Pandas, NLTK, spaCy
Languages
| English | |
| Native speaker |
| Hindi | |
| Fluent |
| Spanish | |
| Fluent |