Hi, I'm Matthew
Genome Analyst | Bioinformatician | Web Developer
Learn MoreI am a Genome Analyst and Bioinformatics Professional with over 7 years of experience in genomic data analysis, variant interpretation, and healthcare analytics. My career began in the wet lab, where I mastered molecular biology techniques and DNA sequencing at institutions like the Clinical Genomics Centre and The Centre for Applied Genomics. This hands-on experience provided a strong foundation in genomic research and clinical diagnostics.
Building on this foundation, I transitioned into variant interpretation at the Laboratory Medicine Programs at UHN and Hamilton Health Sciences, analyzing complex genetic data to support clinical decision-making. To address the increasing complexity of genomic datasets, I expanded my expertise into bioinformatics and programming. Proficient in Python, Flask, and MongoDB, I have developed custom pipelines and web-based tools to streamline workflows, enhance data visualization, and improve data accessibility.
Today, I bring a unique blend of wet lab expertise, variant interpretation proficiency, and technical skills to deliver innovative, data-driven solutions in genomics. My passion lies in leveraging technology to optimize genomic workflows and advance healthcare outcomes.
A Python script that scrapes Nike's website for upcoming sneaker releases, sends email alerts for new products, and calculates urgency based on release time.
Technologies: Python, BeautifulSoup, Pandas, SMTP
This project leverages Reddit's API to scrape and preprocess subreddit data, applying sentiment analysis using TF-IDF and word2vec models to develop a generalized framework for real-time trend insights and community sentiment monitoring.
Technologies: Python, SciKitLearn, nltk, MongoDB
A web-based application for analyzing genomic variants.
Technologies: Python, Flask, MongoDB, Bootstrap, Celery
A framework leveraging REVEL and BayesDel scores to evaluate genetic variant pathogenicity. The pipeline automates data retrieval, processing, and scoring for genomic variants.
Technologies: Python, Pandas, Google Colab, REVEL, BayesDel
A Python-based project analyzing security incidents across Toronto's transit network by integrating TTC open data and real-time Twitter feeds. The analysis identifies trends and patterns in safety-related events for buses, subways, and streetcars.
Technologies: Python, Pandas, Regex, Google Colab, Flourish, Twitter API
A Flask-based web application developed in collaboration with Hamilton Health Sciences, designed to aggregate and organize patient data with alpha or beta globin mutations. This database enables advanced searching and filtering at the sample level, facilitating precise data analysis and clinical decision-making. Inspired by Ithanet but tailored for institutional needs, it aims to improve operational efficiency and patient care.
Technologies: Flask, Python, MongoDB, Pandas, HTML, CSS, JavaScript
A Python script that adjusts subtitle timings in SRT files, allowing precise synchronization for video playback. The tool is customizable with adjustable offsets and ensures seamless formatting.
Technologies: Python, Datetime, File I/O
A highly customized Python script developed for Hamilton Health Sciences (HHSC) to generate patient-specific genomic reports by integrating data from SoftGenetics Geneticist Assistant. The tool automates file monitoring, data processing, and report creation tailored to HHSC's needs.
Technologies: Python, Pandas, Python-docx, BeautifulSoup
A Python-based tool designed to generate copy number variation (CNV) heatmaps from NextGene CNV tool output. This script allows users to filter genes of interest, analyze CNV gains and losses, and visualize the data in an intuitive heatmap format.
Technologies: Python, Pandas, Matplotlib, Seaborn, NumPy