Hello, I'm

Visakh Vijay O

Software Development Engineer · Backend Systems & Data Engineering

Building distributed systems and high-throughput data pipelines.

Experience

Software Development Engineer

UST

Aug 2024 – Present

Building distributed data pipelines and backend systems for enterprise-scale applications with a focus on performance, reliability, and scalability.

  • Contributed to a large-scale replenishment integration system for a U.S. retail chain (2000+ stores), ensuring reliable data synchronization across ERP systems
  • Refactored batch pipelines into Kafka-based real-time streaming workflows, improving data freshness and system maintainability
  • Improved Spark job performance by 35% through optimized joins, caching, and partitioning strategies
  • Built a multi-threaded Python data migration client reducing execution time by 33%
  • Implemented fault-tolerant retry and alerting mechanisms, eliminating recurring job failures and reducing manual intervention
  • Developed event-driven architectures and backend services for high-throughput data processing
PythonPySparkApache SparkKafkaAzure DatabricksAzure Data FactorySQLDocker

Featured Projects

View all →

Geomys: Distributed In-memory Key-Value Store

Geomys is a distributed in-memory key-value store that supports leader-follower replication, persistence, and multi-node clustering. It ensures high availability and eventual consistency across nodes using gRPC-based data replications.

GolanggRPCDocker

Decentralized Online Social Network

A Decentralized Online Social Network built with Ethereum and IPFS

BlockchainIPFSSolidityEtherum

Swiggy Restaurant Data Pipeline

This project develops a data engineering pipeline to analyze restaurant data from various cities on the Swiggy platform. Using PySpark, Spark SQL, and Azure Data Factory, the data is processed and transformed to generate insights on ratings, cuisines, and trends, presented through databricks dashboard.

PySparkAzure Data FactoryDatabricksPython

Skills

Distributed Systems

gRPCLeader-Follower ReplicationEventual ConsistencyFault ToleranceDistributed CachingDocker

Data Engineering

PySparkSpark SQLKafkaDelta LakeAzure Data FactoryDatabricksETL/ELTSCD Type 2

Backend Development

GoPythonJavaNode.jsExpress.jsREST APIsSQL

Cloud & Platforms

AzureCosmos DBBlob StorageBigQueryMySQLMongoDB

Frontend & Tools

ReactNext.jsTailwind CSSGitLinuxBash

Recent Posts

View all →

Test post functionality

This is a test blog post

2020-01-02

Get in Touch

I'm always interested in new opportunities and interesting projects. Feel free to reach out!