Open to Senior Data Engineer roles · U.S. work authorized

Building cloud-native data platforms that scale.

I'm Krishna — a Senior Data Engineer with 15+ years in enterprise data and 7+ years architecting cloud-native platforms on Snowflake, AWS, Azure Data Factory, and Microsoft Fabric. I lead modernization programs that retire legacy SQL Server data marts, replace DataStage ETL with cloud-native ELT, and stand up secure serverless data products.

About

A modernization-minded engineer who ships.

A senior data engineer who's spent the last five years modernizing an enterprise data platform at a national bank — and the decade before that delivering enterprise ETL across healthcare, retail, insurance, and state-government clients.

  • What I do now

    Lead modernization at National Cooperative Bank — Snowflake migration from legacy SQL Server, a secure serverless financial data portal on AWS, hybrid pipelines on Azure Data Factory + Microsoft Fabric, and PySpark Lakehouse engineering on Databricks.

  • Where I've been

    15+ years across enterprise data — IBM DataStage delivery for HCSC, USAA, NY State Dept. of Health, The Children's Place, Whataburger, and Payless. Big-program reliability, big-team delivery, and big-data domains.

  • How I work

    Data products that are governed, observable, and pleasant to operate — Snowflake RBAC and resource monitors, AWS private endpoints + Secrets Manager, real-time monitoring for legacy ETL, idempotent PySpark notebooks, and SQL tuning when it matters.

  • What I'm looking for

    A senior-level role focused on cloud data modernization — Snowflake, AWS, or Microsoft Fabric. Get in touch if that sounds like your team.

Skills

Tech stack & specialties

Hands-on across cloud data platforms, ETL/ELT, dimensional modeling, and security — with deep DataStage roots.

Cloud Data Platforms

SnowflakeAWS RedshiftAWS S3 AWS GlueAthenaLambda API GatewayCognitoDynamoDB Azure Data FactoryMicrosoft FabricFabric Lakehouse

Data Engineering

ETL / ELT designData lakesLakehouse Dimensional modelingStar & Snowflake schemas SCD Type 1/2Performance tuningData quality

Programming & SQL

PythonPySparkPandas Advanced SQLPL/SQLT-SQL Snowflake SQLUNIX shell

Orchestration

Azure Data FactoryFabric Data FactoryFabric Activators IBM DataStage 11.7Apache AirflowControl-M UC4Zena

Databases

SnowflakeMS SQL ServerOracle DB2NetezzaDynamoDB

Security & Governance

AWS IAMCognitoSecrets Manager Snowflake RBACResource Monitors Solix Data MaskingData Quality Rules
Experience

Career timeline

Five years leading cloud modernization at NCB; a decade of enterprise DataStage delivery before that.

Sep 2020 — Present

National Cooperative Bank

Senior Data Engineer · Hybrid (Arlington, VA)
  • Lead Snowflake migration from on-prem SQL Server data mart, modeling RBAC and resource monitors.
  • Architected serverless financial data portal on AWS (Cognito, API Gateway, Lambda, DynamoDB) brokering FIS Global APIs.
  • Built hybrid orchestration with Azure Data Factory + Microsoft Fabric Data Factory and event-driven Activators.
  • Engineered Databricks PySpark notebooks for Lakehouse staging with archival and traceability.
Aug 2018 — Aug 2020

The Children's Place

Lead DataStage Consultant · Secaucus, NJ
  • Owned ETL design for promotions and POS data feeding the EDW and SAP-integrated POS platform.
  • Implemented XML SAP↔POS integration via Hierarchical & MQ stages into Netezza.
May 2017 — Aug 2018

NY State Department of Health

Lead DataStage Consultant · Albany, NY
  • Built parallel ETL for Medicaid enrollment / eligibility analytics powering Cognos reporting.
  • Modernized PL/SQL packages into DataStage jobs; populated Type-1 and Type-2 dimensions.
Jan 2016 — May 2017

Health Care Service Corporation (Accenture)

Lead DataStage Consultant · Chicago, IL
  • Designed parallel ETL frameworks on IBM Information Server with SCD, partitioning, master sequencers.
  • Provided 24x7 production support for clinical and member data domains.
Sep 2014 — Oct 2015

Whataburger

Lead DataStage Consultant · San Antonio, TX
  • Designed warehouse ETL with Quality Stage cleansing; led offshore delivery and DataStage→SSIS migration.
Dec 2011 — Sep 2014

USAA (Accenture)

DataStage Consultant · San Antonio, TX
  • Built EDW ingest from Oracle, Netezza, DB2, MS SQL Server across HRMS / EPM domains.
  • Migrated DataStage 8.1 → 9.1 with no production disruption.
Aug 2010 — Nov 2011

Payless Shoe Source

DataStage Developer · Topeka, KS
  • Loaded star-schema warehouse with SCDs and Quality Stage cleansing routines.
Featured Projects

Selected work

A handful of programs I've led — each one a real cloud data platform decision rather than a side project.

Cloud migration

SQL Server → Snowflake Data Mart

End-to-end migration of a legacy on-prem data mart to Snowflake. Designed virtual warehouses, resource monitors, and RBAC; translated complex T-SQL into Snowflake SQL using Streams & Tasks; rewired DataStage ETLs into bulk loads via internal named stages.

~40%
faster reports
100%
RBAC coverage
SnowflakeDataStageT-SQL → Snowflake SQLRBAC
Serverless · AWS

Secure Financial Data Portal

Architected a serverless portal that lets bank customers view their managed entities. Cognito identity, API Gateway + Lambda authorization layer, DynamoDB-backed user/account hierarchy maintained by DataStage pipelines, and AWS Secrets Manager + private endpoints for FIS Global API calls.

Multi-tier
authorization
0
long-lived secrets in code
AWS CognitoAPI GatewayLambdaDynamoDBSecrets Manager
Hybrid cloud · Microsoft Fabric

Hybrid Pipelines: ADF → Fabric DataMart

Built lift-and-shift pipelines from on-prem EDW SQL Server into Fabric Warehouse using Azure Data Factory, then drove Fabric DataMart loads via stored procedures. Event streams + Activators auto-trigger Fabric loads on DataStage completion and refresh MicroStrategy reports.

Event-driven
end-to-end batch
Hybrid
on-prem + cloud
Azure Data FactoryMicrosoft FabricActivatorsMicroStrategy
Lakehouse

Databricks PySpark Lakehouse Staging

Modular PySpark notebooks for file validation, transformation, and movement across Lakehouse staging — with automated archival of processed CSVs to ensure idempotent reruns and end-to-end traceability.

Idempotent
reruns
Modular
PySpark
DatabricksPySparkFabric Lakehouse
Reliability

Real-time DataStage Monitoring

Built a real-time monitoring pipeline and operational dashboard over the DataStage ecosystem ingesting from 15+ core banking systems and external SOAP/REST APIs — giving Operations first-time visibility into batch performance.

15+
source systems
First-time
batch visibility
DataStage 11.7SOAP / RESTOperational dashboard
Enterprise ETL

Healthcare & Medicaid Analytics Pipelines

Decade of leading IBM DataStage delivery — Medicaid enrollment / eligibility analytics for NY DOH (Cognos), clinical and member data for HCSC, and HR/EPM data for USAA. Type 1/2 dimensions, surrogate keys, Quality Stage cleansing, and DataStage 8.1 → 9.1 in production with no downtime.

10+ yrs
DataStage delivery
0
downtime upgrades
DataStagePL/SQLQuality StageCognos
Get in touch

Let's build the next platform together.

I'm available for Senior Data Engineer roles focused on cloud data modernization, Snowflake, AWS, or Microsoft Fabric. Happy to chat about a specific role, an architecture you're working through, or a migration program you're scoping.