Welcome to my storage!
Constructed a Standard (batch) pipeline to extract, transform and load data to the destination, created an attractive dashboard
Google Storage, Compute Instance, BigQuery, Looker Studio, Modern Data Pipeline Tool- Mage
Gathered data from previous years and share the findings of the generation, relative consumption, imports and exports. To further predict factors like revenue, investment and incentive.
HDFS, Hive, Zeppelin, Spark SQL.
Replicated the Whole Analysis in R by using ggplot and various other operations.
Analyzed dataset and created Dashboard
COPY command, S3, LAMBDA, GLUE ETL job, AWS GLUE studio, Quick sight, Athena
Load the data from S3 to Data warehouse, Data wrangling and created dashboard
Python, S3, RedShift, Star Schema, AWS Quicksight
Identified the Buy and sell state of the Stock based on up & down trends from DMA.
Jupyter Notebook, yfinance, Pandas, Matplotlib, Seaborn
Created DAG with arguments and to call function from twitter_etl.py through python operator imported from airflow
Airflow, EC2, Twitter API
A pipeline will be triggered whenever a new object will be added to S3
External stage, SNS topic, subscribe, SNOWPIPE, Snowflake
Send tweets to topic via producer and consumer will consumes the data and store it in S3 & analyzed data
EC2, SSH, KAFKA Producer and Consumer, S3, Crawler, Athena
Analyzed various measures & dimensions, Built dashboards
Excel, Tableau
A Rest API to handle various library services
H2 embedded database, Hibernate ORM, persistence, EJBs, JAX-RS, Lombok, Wild Fly10, Postman
A Spring MVC web application to maintain pet-owners data and services
Spring boot, Controllers, Thymleaf, JPA
A Spring boot Application with various API services to handle various operation to handle directory
Spring Boot, Spring Security, Thymeleaf, H2
Migrated data from MySQL to HDFS, Read the data in HIVE
MySQL, HDFS, Hive, Apache Sqoop
Used Twitter API for data source and processed data.
Twitter API, HDFS, Hive, Spark, SQL