INITIALIZING...

OPEN TO OPPORTUNITIES

HI, I'M
ANUJ PATIL

GIS & DATA SCIENTIST

"Turning Location Data into Actionable Intelligence"

I transform complex geospatial data into clear, impactful solutions. From building cost-efficient cloud pipelines to developing ML models and interactive dashboards, I help organizations make smarter, data-driven decisions. Currently exploring new challenges across GIS, data engineering, analytics, and machine learning.

FEATURED PROJECTS

FILTER:
Delivered to 3 counties2024

A2A Corridor Roadkill Analysis

Identified wildlife–vehicle collision hotspots using kernel-density and penalized regression. Analyzed 1,000+ collision records with traffic, land cover, and environmental data. Impact: Planner-ready mitigation layers delivered to 3 counties.

ArcGIS ProGeoPandasscikit-learn+2
VIEW CODE
Deployed for St. Lawrence Transit2024

Cost-Efficient GTFS-RT Pipeline

Deployed Lambda+S3 GTFS-RT pipeline ($4/month) with 30-day history for service and on-time performance analysis. Unified GTFS static/RT/Flex with automated QA.

AWS LambdaS3GTFS+2
VIEW CODE
Interactive visualization2024

GTFS Transit Analysis StoryMap

Created interactive ArcGIS StoryMap showcasing transit network analysis and service patterns. Visualized route performance, ridership trends, and accessibility metrics.

ArcGIS OnlineStoryMapGTFS+2
VIEW CODE
Deployed for VTC2024

Microtransit Zone Design

Designed zone-based microtransit service areas using demand analysis, travel-time modeling, and service constraints. Optimized zone boundaries for efficiency.

ArcGIS ProPythonNetwork Analysis+2
VIEW CODE
88.3% accuracy2023

Hotel Booking Cancellation Prediction

CatBoost model achieved 88.3% accuracy predicting cancellations. Used feature importance and partial dependence to explain impact of lead time, ADR, and past cancellations.

PythonCatBoostML+1
VIEW CODE
Top 5% risk segments identified2023

NY Road Network Safety Analysis

Unified crashes, traffic counts, and environmental data into geodatabase. Computed segment-level features and composite risk score to prioritize top 5% segments.

ArcGIS ProPythonPostgreSQL+2
VIEW CODE
1M tweets processed2022

Cyberbullying Detection (NLP)

Classified toxic content at scale using classical NLP (TF–IDF + linear models) and deep learning. Preprocessed 1M tweets with class imbalance handling and confusion-matrix analysis.

PythonNLPTensorFlow+2
VIEW CODE

EXPERIENCE

GIS Data Scientist

Volunteer Transportation Center (VTC)

St. Lawrence and Jefferson County Public Transit, NYDec 2023 – Present
  • Own the geospatial/data backbone – turn rider needs and ops constraints into GTFS, analytics, and tools used by planners, dispatchers, and the public.
  • Built cost-efficient AWS pipelines ($4/month) for GTFS-RT with 30-day history, reducing manual data processing by 50% and enabling real-time service analysis.
  • Unified service design across modes. Aligned fixed-route, FMLM, and zone-based microtransit using demand, travel-time, and OTP analysis for 5 counties.
  • Co-developed microtransit zones/windows and shipped LLM assistants for fixed-route operations, improving dispatch efficiency by 30%.
  • Selected speaker at MobilityData Conference, Montréal (Oct 2024) - presented on cost-efficient GTFS-RT pipeline architecture.

GIS Intern

Clarkson University CEM Consulting Group (C3G)

Potsdam, NYJul 2023 – Dec 2023
  • Delivered GIS for Complete Streets and urban planning; updated municipal geodatabases and parcel/transport layers for 10+ municipalities.
  • Supported A2A Wildlife Connectivity Study: analyzed 1,000+ collision records using spatial joins, kernel-density, and hotspot modeling to produce planner-ready mitigation layers delivered to 3 counties.
  • Built and maintained ArcGIS Online web apps and data products for public engagement, serving 5+ planning projects.

Research Assistant

Vestibular Lab, Clarkson University

Potsdam, NYMar 2023 – Aug 2023
  • Streamlined VR/EMG data collection workflows to improve trial throughput by 20% while maintaining quality controls.
  • Developed R/Python/MATLAB models (incl. CNN/RNN baselines) for incident detection; improved classification accuracy by 25% compared to baseline methods.

Graduate Assistant

Clarkson University

Potsdam, NYAug 2022 – May 2023
  • Supported academic research and coursework in GIS and data science programs.
  • Assisted with data analysis and visualization projects for faculty research.

System Administrator & Data Analyst

Delonix Society's Baramati College

Baramati, Maharashtra, IndiaJun 2021 – Jul 2022
  • Managed IT infrastructure and systems administration for college operations.
  • Performed data analysis and reporting to support administrative decision-making.
  • Maintained databases and ensured system reliability and security.

SKILLS & TOOLS

PROGRAMMING

Python95%
SQL85%
R75%

GEOSPATIAL

ArcGIS Pro92%
GeoPandas90%
QGIS88%
GTFS88%

CLOUD & DATA

AWS Lambda/S386%
PostgreSQL82%

ML & ANALYTICS

scikit-learn88%
TensorFlow80%

WHAT I BRING

  • End-to-end data pipelines — from raw ingestion to production dashboards
  • Cost-conscious cloud architecture (AWS) with real cost savings track record
  • Geospatial analysis and visualization that tells compelling stories
  • ML models deployed in production, not just notebooks

PROFICIENCY

Loading...
🏆

ACHIEVEMENTS

🎤

MobilityData Conference

Presented GTFS-RT pipeline architecture

MontréalOct 2024
🏅

Phalanx Service Award

Outstanding service and leadership

Clarkson U.2023
👥

Club Leadership

Led GIS and data science organizations

Clarkson U.2022-23
💰

Cost Optimization

$4/month AWS pipeline, 90% cost reduction

VTC Transit2024

GET IN TOUCH

OPEN TO NEW OPPORTUNITIES

Looking for full-time roles, contracts, or collaborations in GIS, Data Science, Analytics, or ML Engineering.