MD AMIR KHAN

Data Scientist | Data Engineer | Financial Engineer

A versatile data professional with expertise in quantitative finance, data science, and data engineering. I specialize in leveraging advanced analytics, machine learning, and robust data pipelines to solve complex business problems across financial markets, investment strategies, and enterprise data systems. With a strong foundation in statistical modeling, programming, and data architecture, I thrive on transforming raw data into actionable insights and scalable solutions.

MD Amir Khan - Financial Engineer
Stevens
NSU
SCB
UCB

About Me

I am a versatile data professional with a passion for transforming complex data into actionable insights and scalable solutions. My expertise spans quantitative finance, data science, and data engineering, enabling me to tackle diverse challenges across financial markets, enterprise systems, and analytical workflows. With a strong foundation in statistical modeling, programming, and data architecture, I excel at solving complex problems through innovative data-driven approaches.

In the realm of data science, I specialize in building predictive models, performing advanced statistical analysis, and developing machine learning solutions that drive business value. My experience includes end-to-end data science workflows from data collection and preprocessing to model deployment and monitoring. I have successfully implemented various ML algorithms including neural networks, ensemble methods, and time series forecasting models.

As a data engineer, I design and implement robust data pipelines, ETL processes, and data infrastructure that can handle large-scale, real-time data processing. I have extensive experience with cloud platforms, database optimization, and building scalable data architectures that support both analytical and operational workloads.

In quantitative finance, I have developed sophisticated trading algorithms, risk management systems, and portfolio optimization frameworks. My work combines traditional financial theory with modern data science techniques to create innovative solutions for investment management and risk assessment.

My strong analytical mindset and technical versatility enable me to work effectively across different domains, from financial markets to enterprise data systems. I am a collaborative team player who thrives in fast-paced environments, where I can leverage my diverse skill set to work closely with data scientists, engineers, traders, and business stakeholders.

I am constantly expanding my knowledge and staying current with the latest advancements in data science, machine learning, and data engineering. My core areas of expertise include:

  • Machine Learning & AI: Neural networks, ensemble methods, deep learning, NLP
  • Data Engineering: ETL pipelines, cloud platforms, database optimization, real-time processing
  • Data Science: Predictive modeling, statistical analysis, feature engineering, model deployment
  • Quantitative Finance: Portfolio optimization, risk modeling, algorithmic trading
  • Big Data Technologies: Spark, Hadoop, Kafka, distributed computing
  • Cloud & DevOps: Docker, CI/CD, infrastructure as code

3.90

Current GPA

5+

Years Experience

10+

Projects Completed

Education

Master's in Financial Engineering & Analytics

Stevens Institute of Technology

2024 - 2025

GPA: 3.90/4.00

Focus: Quantitative Finance, Algorithmic Trading, Risk Analytics, Portfolio Optimization

Stochastic Calculus for Financial Eng. Applied Probability & Statistics in Finance Advanced Financial Risk Analytics & Derivatives Machine Learning in Finance Pricing & Hedging Computational Methods in Finance Portfolio Theory & Applications Market Microstructure Algorithmic Trading Strategies Design Patterns & Derivative Pricing in C++ Optimization in Finance

Bachelor in Business Administration

North South University (NSU)

2018 - 2022

Major: Finance | Minor: Mathematics

Key Courses: Calculus, Linear Algebra, Differential Equations, Corporate Finance, Investment Theory, Financial Derivatives, Applied Statistics

Latest News

Professional Certification

FRM Part I Exam Preparation

Currently preparing for the Financial Risk Manager (FRM) Part I examination, focusing on quantitative analysis, financial markets and products, and risk management foundations.

Research Paper

Towards a Robust PCA and Dynamic Factor Portfolios Updating

Working on a research paper with Professor Papa Momar NDIAYE. In this paper, we propose a dynamic tracking algorithm that modifies the classical Principal Component Analysis to reduce the instability of risk levels and principal factors. The goal is twofold: to ensure to the principal factors some immunity against perturbations on observations and to stabilize the factors when updating covariance matrix.

Those factors are of paramount importance in a portfolio allocation setting where they form a basis of orthogonal factor portfolios that generates the efficient frontier. By ensuring a smooth transition between those factors portfolio under some spectral conditions and detecting changes in risk clusters that warrant a reset of the entire tracking process, we provide tools to improve the timing of portfolio rebalancing and reduce transaction costs.

Experience

Data Science Research Assistant

Stevens Institute of Technology

June 2024 - Present ยท Part-time
  • Led end-to-end data science projects combining quantitative finance with advanced analytics, focusing on machine learning model development, data pipeline engineering, and automated trading systems
  • Architected and implemented scalable data engineering solutions for processing large-scale financial datasets, including real-time data ingestion from Polygon API and Nasdaq auction data using Python, Apache Spark, and cloud platforms
  • Developed and deployed production-ready machine learning models (neural networks, LightGBM, ensemble methods) for predictive analytics, achieving high accuracy in short-term price movement prediction and options pricing
  • Built comprehensive backtesting frameworks and automated performance monitoring systems with real-time dashboards, enabling data-driven decision making and strategy optimization
  • Designed robust data pipelines and ETL processes for high-frequency trading data, implementing automated feature engineering, model validation, and risk management systems
  • Pioneered research in AI-powered financial applications, exploring agentic AI and financial LLM integration for next-generation trading frameworks and automated decision systems
Python Machine Learning Data Engineering Apache Spark Neural Networks Cloud Platforms ETL Pipelines Financial LLMs

Quantitative Data researcher

United Commercial Bank

January 2022 - December 2023
  • Engineered automated SQL/Python pipelines for 1,400+ funds, streamlining data processing and reporting workflows
  • Built Tableau/Power BI dashboards for $200M+ portfolios, enabling managers to monitor volatility and downside risk
  • Designed predictive allocation models, boosting Sharpe ratio by 34% through EWMA and shrinkage covariance methods.
  • Applied statistical attribution methods to 14 quarters of fund data, uncovering performance drivers and enhancing compliance
  • Delivered business-facing insights by translating complex data outputs into clear, actionable recommendations for executives
Python SQL Tableau Power BI EWMA Risk Analytics

Quant Investment & Data Analyst

Standard Chartered Bank

January 2021 - December 2021
  • Built scalable Python/SQL pipelines to process 50K+ daily transactions, improving data integrity and anomaly detection
  • Analyzed and backtested equity portfolios using pandas, matplotlib & Plotly, delivering performance dashboards (return, drawdown, risk metrics) to portfolio managers to evaluate strategy effectiveness.
  • Automated daily risk and performance reporting for equity strategies with $200M+ AUM, ensuring accuracy and timeliness
  • Designed Tableau dashboards to visualize key portfolio metrics, enabling senior management to track risk and performance
  • Collaborated with portfolio and risk managers on $100M+ portfolios, translating analytics into actionable insights improving investment decisions
Python SQL Pandas Matplotlib Plotly Tableau

Projects

Bond Portfolio Optimization and Immunization

August 2025

Comprehensive bond portfolio management system combining quantitative finance with data engineering. Implements duration matching, convexity adjustments, and immunization strategies using real-time data pipelines, automated risk calculations, and scalable portfolio optimization algorithms for fixed income portfolios.

Python Fixed Income Duration Matching Immunization Interest Rate Risk

Vasicek Bond Pricing Model - Monte Carlo, PDE & Analytical

July 2025

Comprehensive implementation of the Vasicek interest rate model featuring three pricing approaches: analytical solutions, Monte Carlo simulations, and PDE finite difference methods for zero-coupon bonds.

Jupyter Notebook Vasicek Model Monte Carlo PDE Analytical Solutions

Portfolio Optimization

July 2025

Strategic asset allocation framework using modern portfolio theory, risk parity, and advanced optimization techniques with Riskfolio-Lib for multi-asset portfolio construction.

Jupyter Notebook Portfolio Theory Riskfolio-Lib Risk Parity Asset Allocation

Vasicek Bond Pricing and Kalman Filtering

June 2025

Multi-method fixed income modeling combining Vasicek interest rate dynamics with Kalman filtering for parameter estimation and state variable tracking in bond pricing applications.

Jupyter Notebook Kalman Filter Fixed Income Parameter Estimation State Space Models

Data Science Projects

June 2025

Collection of data science applications in finance including statistical analysis, machine learning models, and data visualization for financial time series and market data.

Jupyter Notebook Data Science Statistical Analysis Machine Learning Financial Data

Trading Strategy Based on MACD Signals

June 2025

Technical analysis-driven trading strategy using MACD (Moving Average Convergence Divergence) indicators for signal generation, backtesting, and performance evaluation.

Jupyter Notebook Technical Analysis MACD Trading Signals Backtesting

Cryptocurrency Forecasting Using ARIMA

June 2025

Time series forecasting application for cryptocurrency price prediction using ARIMA models, stationarity testing, and model selection for optimal forecasting accuracy.

Jupyter Notebook ARIMA Time Series Cryptocurrency Forecasting

Stock Price Prediction and Trading Strategy Using LSTM

March 2025

Deep learning approach to stock price prediction using LSTM neural networks, combined with algorithmic trading strategy development and performance backtesting.

Jupyter Notebook LSTM Deep Learning Stock Prediction Neural Networks

Stock Brokerage System Low Level Design

February 2025

High-performance stock brokerage system architecture featuring order matching engine, portfolio management, and real-time market data processing.

C++ System Design Order Matching Low Latency Trading Systems

Option Pricing Models

February 2025

Comprehensive options pricing library implementing Black-Scholes, binomial trees, and Monte Carlo methods for European and American options valuation with Greeks calculation.

Jupyter Notebook Options Pricing Black-Scholes Binomial Trees Greeks

SPY Momentum Alpha Backtesting

February 2025

High-frequency momentum trading strategy combining data engineering and quantitative finance. Built robust data pipelines processing 2 years of SPY tick data from Polygon API, implemented real-time signal generation, and achieved 79% total return with comprehensive performance analytics and automated backtesting frameworks.

Jupyter Notebook Momentum Trading Polygon API High Frequency Backtesting

Pairs Trading Strategy

February 2025

Statistical arbitrage strategy using cointegration analysis and mean reversion. Employed Euclidean distance method for pair selection with z-score based entry/exit signals.

Jupyter Notebook Pairs Trading Cointegration Statistical Arbitrage Mean Reversion

Options Pricing Using Machine Learning

September 2024

Advanced machine learning approach to options pricing combining deep learning with financial engineering. Implemented neural networks, random forests, and ensemble methods with automated feature engineering, model validation pipelines, and real-time pricing systems that outperformed traditional Black-Scholes pricing in complex market conditions.

Jupyter Notebook Machine Learning Neural Networks Options Pricing Ensemble Methods

Market Analytics Web Application

August 2024

Full-stack data science application for comprehensive market analysis. Built interactive web application with real-time data ingestion pipelines, advanced data visualization dashboards, automated technical indicator calculations, and machine learning-powered trading signal generation with scalable cloud deployment.

Jupyter Notebook Web Application Market Analysis Data Visualization Technical Indicators

Activities & Awards

Student Membership

CFA Society New York

Student member, actively engaged in professional events

Open Source Contribution

Riskfolio-Lib

Contributing to a leading Python library for portfolio optimization and risk management

Competition Participation

WorldQuant's 2023 International Quant Championship

Competed in crafting & testing advanced trading strategies

Certifications & Licenses

udemy

Complete Algorithmic Trading Course with Python, ChatGPT, ML

Udemy

July 2025

Comprehensive algorithmic trading course covering Python programming, machine learning integration, and ChatGPT applications for automated trading strategies.

Algorithmic Trading Python Machine Learning ChatGPT

Akuna Capital Options 101

Akuna Capital

July 2025

Professional options trading course from leading market maker covering payoff diagrams, volatility, Greeks, and market-making fundamentals.

Options Trading Greeks Volatility Market Making
udemy

Complete Data Science, Machine Learning, DL NLP Bootcamp

Udemy

July 2025

Comprehensive bootcamp covering data science fundamentals, machine learning algorithms, deep learning, and natural language processing applications.

Data Science Machine Learning Deep Learning NLP
udemy

Taking Python to Production: Professional Onboarding Guide

Udemy

July 2025

Advanced Python course focusing on production deployment, best practices, and professional development workflows for enterprise applications.

Python Production DevOps Enterprise
udemy

The Ultimate JSON With Python Course + JSONSchema & JSONPath

Udemy

July 2025

Comprehensive JSON handling in Python including schema validation, path queries, and advanced data manipulation techniques.

JSON Python JSONSchema Data Processing
udemy

Master Time Series Analysis and Forecasting with Python 2025

Udemy

June 2025

Advanced time series analysis covering ARIMA, SARIMA, Prophet, LSTM, and modern forecasting techniques for financial and business applications.

Time Series ARIMA Prophet LSTM
udemy

Manage Finance Data with Python & Pandas: Unique Masterclass

Udemy

July 2025

Specialized course on financial data management and analysis using Python and Pandas for quantitative finance applications.

Pandas Financial Data Data Management Quantitative Analysis
udemy

Master Regression & Prediction with Pandas and Python [2025]

Udemy

July 2025

Advanced regression analysis and prediction modeling using Python and Pandas for financial and statistical applications.

Regression Analysis Prediction Models Pandas Statistical Analysis
udemy

Mathematics-Basics to Advanced for Data Science And GenAI

Udemy

July 2025

Comprehensive mathematics foundation covering linear algebra, calculus, probability, and statistics for data science and AI applications.

Linear Algebra Calculus Probability Statistics
udemy

Python Object Oriented Programming (OOP): Beginner to Pro

Udemy

July 2025

Advanced Python OOP concepts including inheritance, polymorphism, design patterns, and enterprise-level programming practices.

Python OOP Design Patterns Inheritance Polymorphism
udemy

The Complete SQL Bootcamp (30 Hours): Go from Zero to Hero

Udemy

July 2025

Comprehensive SQL training covering database design, complex queries, optimization, and real-world database management scenarios.

SQL Database Design Query Optimization Data Management
udemy

Fixed Income Analytics: Pricing and Risk Management

Udemy

July 2025

Specialized fixed income course covering bond pricing, yield curve analysis, duration, convexity, and interest rate risk management.

Fixed Income Bond Pricing Yield Curves Risk Management
udemy

Learn Python Requests

Udemy

July 2025

Specialized Python course focusing on HTTP requests, API integration, and web scraping for financial data collection.

Python Requests API Integration Web Scraping Data Collection
udemy

The Ultimate Pandas Bootcamp: Advanced Python Data Analysis

Udemy

July 2025

Advanced Pandas mastery for complex data manipulation, analysis, and visualization in financial and business contexts.

Pandas Data Analysis Data Manipulation Financial Analysis
udemy

FastAPI - The Complete Course 2025 (Beginner + Advanced)

Udemy

July 2025

Modern Python web framework for building high-performance APIs, essential for financial data services and algorithmic trading platforms.

FastAPI REST APIs Web Development Python
udemy

Mathematical Foundations of Machine Learning

Udemy

2025

Deep mathematical foundations covering linear algebra, partial derivatives, calculus, and probability theory for advanced machine learning applications.

Linear Algebra Calculus Probability Machine Learning
udemy

Python Data Analysis: NumPy & Pandas Masterclass

Udemy

2025

Advanced data analysis techniques using NumPy and Pandas for quantitative finance and statistical computing applications.

NumPy Pandas Data Analysis Statistical Computing
edX

GSX Verified Certificate for Probability - The Science of Uncertainty and Data

MIT / edX

December 2022

Rigorous probability theory course covering uncertainty quantification, statistical inference, and data analysis fundamentals from MIT.

Probability Theory Statistical Inference Uncertainty Data Analysis
Coursera

Python and Statistics for Financial Analysis

Coursera

February 2022

Specialized course combining Python programming with statistical methods for financial data analysis and investment decision making.

Python Financial Statistics Investment Analysis Portfolio Management

Technical Skills

Programming Languages

Python
SQL

Data Science & ML Libraries

NumPy Pandas SciPy Scikit-learn PyTorch TensorFlow Keras Matplotlib Seaborn Plotly OpenBB Statsmodels Zipline PyFolio Riskfolio-Lib Vectorbt

Data Engineering & Big Data

Apache Spark Apache Kafka Apache Airflow Hadoop Elasticsearch Redis MongoDB PostgreSQL MySQL Apache Beam dbt Great Expectations Databricks PySpark Delta Lake SQL Warehouses

Cloud & DevOps

Google Cloud Azure Azure Data Factory Azure Synapse Analytics Azure Data Lake Storage (ADLS) Docker Kubernetes Jenkins GitHub Actions Terraform Ansible CI/CD Infrastructure as Code

Development & Tools

Jupyter Notebook Git GitHub VS Code FastAPI Streamlit Tableau Power BI Excel MLflow Kubeflow Apache Superset

Data Science & Analytics

Machine Learning Deep Learning Statistical Modeling Predictive Analytics Feature Engineering Model Deployment A/B Testing Data Visualization Time Series Analysis Natural Language Processing Computer Vision Recommendation Systems

Quantitative Finance

Portfolio Optimization Risk Management Algorithmic Trading Derivatives Pricing Backtesting VaR & CVaR Factor Modeling Market Microstructure Fixed Income Analytics Options Pricing Stress Testing Performance Attribution

Resume

MD Amir Khan - Resume

Data Scientist & Financial Engineer

Updated: December 2024

Format: PDF

Get In Touch

Email

mkhan37@stevens.edu

Phone

+1 (201) 234-7017

Location

Hoboken, New Jersey, USA

LinkedIn

linkedin.com/in/amirkhan2317