Skip to content
View shivareddy2002's full-sized avatar

Block or report shivareddy2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shivareddy2002/README.md

Header Banner


👋 Hi, I'm Lomada Siva Gangi Reddy [ 🌐 Visit My Portfolio ]

  • 🎓 B.Tech in Computer Science & Engineering (Data Science) — RGMCET (2021–2025) | CGPA: 8.3
  • 📊 Aspiring Data Professional (Data Engineer | Data Analyst | Data Scientist)
  • 💼 Skilled in Python, SQL, Snowflake, ETL, Power BI, ML, DL, Visualization
  • 📍 Based in Andhra Pradesh, open to opportunities in Hyderabad, Bengaluru, Chennai, Pune
  • 🌱 Currently exploring Advanced Deep Learning & End-to-End ML Pipelines

🛠️ Tech Skills

  • Programming Languages & Databases:PythonSQL
  • Data Engineering :SnowflakeETL PipelinesData WarehousingData ModelingCDC
  • Data Science & AI :Data PreprocessingData AnalyticsMLDL NLP LLM
  • Tools :ExcelMySQLPower BIJupyter NotebookVS Code
  • Libraries & Frameworks :NumpyPandasMatplotlibSeabornTensorFlowScikit-learnNLTK
  • Soft Skills :TeamworkProblem SolvingCommunicationQuick LearningTime Management

📌 Current Focus & Learning

  • 🚀 Building end-to-end ETL pipelines
  • ☁️ Working with Snowflake Cloud Data Platform
  • 🔄 Learning Change Data Capture (CDC) & Data Streaming
  • ⚡ Optimizing query performance & data workflows

💼 Internship Experience

🚀 Data Engineering Intern — Boolean Data Private Limited

📍 Hyderabad, India | 📅 Mar 2026 – Present

  • Building scalable ETL pipelines using Snowflake, SQL, Python
  • Designing and optimizing data warehousing solutions
  • Performing data migration and transformation
  • Implementing CDC (Change Data Capture) pipelines
  • Improving performance using query optimization & warehouse tuning

Skills: SnowflakeSQLPythonETLData PipelinesData Warehousing

🧠 Data Science Intern — CEDLEARN

📅 Oct 2025 – Dec 2025

  • Worked on real-world data analysis and AI-driven projects under industry mentorship.
  • Gained hands-on experience with Python, SQL, Machine Learning, and Streamlit for interactive analytics.
  • Developed and deployed end-to-end data visualization dashboards and text-based AI models.

🔹 Notable Projects during Internship

  • Built an end-to-end data pipeline using Snowflake to ingest raw retail data, transform it, and deliver analytics-ready datasets
  • Implemented Change Data Capture (CDC) using Snowflake Streams & Tasks for real-time incremental data processing
  • Designed a Star Schema (Fact & Dimension tables) to optimize query performance and reporting efficiency
  • Developed incremental loading (MERGE strategy) to handle inserts & updates efficiently
  • Integrated with Power BI dashboards for business insights and reporting
  • Tools: SnowflakeSQLETLData WarehousingCDCPower BI
  • Built a GRU-based RNN to generate text sequences from a custom corpus
  • Features seed text input, beam search, and interactive Streamlit web app for real-time text generation
  • Efficient lightweight architecture with fewer parameters than LSTMs for faster training
  • Tools: PythonTensorFlow/KerasStreamlitNumPyPandasMatplotlib
  • Analyzed restaurant dataset to identify patterns in ratings, food types, and pricing
  • Built visualizations for customer preferences and restaurant performance evaluation
  • Tools: • PythonPandasNumPyMatplotlibSeabornScikit-learnStreamlit
  • 📈 Delivered actionable insights on customer preferences, city-level trends, and restaurant performance

🚀 Featured Projects

  • Developed CNN + LSTM + Decision Tree + Random Forest models on bioristor IoT sensor data
  • Predicted plant water stress levels to enable smart irrigation & water conservation
  • Tools: PythonTensorFlowScikit-learnPandasMatplotlib
  • 📑 Published in Periodico di Mineralogia (DOI: 10.5281/zenodo.15047038)

  • Built a CNN-based deep learning model to classify rice grain varieties from images
  • Predicted Arborio, Basmati, Ipsala, Jasmine, Karacadag with confidence scores
  • Developed an interactive Streamlit web app for real-time predictions
  • Tools: PythonTensorFlow/KerasStreamlitNumPyMatplotlib

  • Built an end-to-end Business Intelligence dashboard to analyze credit card transactions and customer behavior
  • Delivered Weekly, Quarterly & YTD insights on revenue, transactions, activation, and delinquency
  • Designed a SQL-based data model with One-to-Many relationships for analytical reporting
  • Created DAX measures for revenue, customer segmentation, and time intelligence (WoW analysis)
  • Developed an interactive Power BI dashboard with slicers for card type, gender, income, region, and transaction mode
  • Tools: Power BISQL (MySQL)DAXData ModelingData Analytics

  • Developed an end-to-end Machine Learning & Deep Learning web application to predict used car prices
  • Implemented Random Forest & ANN models with preprocessing pipelines for accurate predictions
  • Built a professional Streamlit dashboard with INR-formatted outputs
  • Included feature importance, depreciation trends & market insights (XAI)
  • Tools: PythonScikit-learnTensorFlow/KerasPandasNumPyStreamlit

  • Built an interactive chatbot to retrieve real-time information from Wikipedia
  • Integrated Wikipedia API for knowledge extraction
  • Tools: HTMLCSS, JavaScriptAPI integration
  • 📚 Useful for educational assistance & quick knowledge queries

📑 Project Summary Table

🚀 Project 📝 Description 🛠 Tech Stack 🔗 Links
🛒 Retail Sales Data Pipeline Developed a Snowflake-based ETL pipeline with CDC and data modeling for analytics and reporting Snowflake, SQL, ETL, Data Warehousing, CDC, Power BI GitHub
🌱 Water Stress Prediction Predicted water stress using CNN+LSTM+ML on IoT data Python, TensorFlow, Sklearn Demo
🌾 Rice Type Classifier Classified rice varieties with CNN and Streamlit app Python, Keras, Streamlit Demo
🚗 Car Price Prediction AI-powered web app for used car price prediction using ML & DL models Python, Scikit-learn, TensorFlow, Streamlit Demo
📝 Text Generation GRU-based RNN for sequence generation with beam search Python, TensorFlow, Streamlit Demo
💳 Credit Card Dashboard Interactive Power BI dashboard for financial performance, customer behavior & risk analysis Power BI, SQL, DAX, Data Modeling Demo
🤖 Word Search Chatbot Interactive chatbot using Wikipedia API HTML, CSS, JS, API Demo
📊 Zomato Dashboard Data analysis & visualizations for restaurant insights Python, Pandas, Seaborn Demo


📑 Publications


🏆 Certifications

  • Snowpro Core Certification (COF-C03)— Snowflake
  • Data Science InternshipCeduraTech \
  • Data Science With AICEDLEARN
  • SQL Skill-Up Certification – GeeksforGeeks
  • Snowflake Certification Preparation (COF-C03)— Udemy
  • Data AnalystSimplilearn (Microsoft)
  • Data Science Course Completion CertificationCEDLEARN
  • AI Generalist CredentialDigital Maven
  • Machine Learning InternshipSkillDzire
  • Java Full Stack DevelopmentWipro TalentNext
  • SQL CertificationProgramming Hub
  • Big Data & HadoopedX
  • Data Science Guided PathCoding Ninjas Studio

🏅 Achievements & Extras

  • 🏆 Certificate of Merit — Naukri Campus Young Turks (Top 93.91 percentile)
  • 🎯 HackerRank: Python, Java, SQL (5⭐ each)
  • 💻 Solved 50+ coding problems on LeetCode & GeeksforGeeks
  • 📚 Published research paper in an international journal (2025)

📊 GitHub Stats & Activity


🌐 Connect with Me

🚀 Open to collaborations in Data Science, AI & ML Projects!

Thank you for visiting my GitHub! 🌟 Check out my projects, connect with me, and let's collaborate! 🤝

Pinned Loading

  1. GRU-Text-Generation GRU-Text-Generation Public

    Text Generation using a GRU-based Deep Learning model with an interactive Streamlit interface.

    Jupyter Notebook

  2. classification-and-forecasting-of-water-stress-in-tomato-plant classification-and-forecasting-of-water-stress-in-tomato-plant Public

    Streamlit web app for classification and forecasting of water stress in tomato plants using Bioristor data and deep learning models.

    Jupyter Notebook

  3. rice-grain-classifier rice-grain-classifier Public

    A CNN-powered rice grain classifier that accurately identifies different rice varieties using deep learning and Streamlit.

    Jupyter Notebook 1

  4. car-price-prediction car-price-prediction Public

    AutoPredict Pro – An end-to-end ML & Deep Learning web app for accurate used car price prediction using Random Forest & ANN, deployed with Streamlit.

    Jupyter Notebook

  5. Credit_Card_Transaction_Report Credit_Card_Transaction_Report Public

    Credit Card Transaction Analytics project using SQL and Power BI, featuring interactive dashboards, KPIs, and business insights on revenue, customers, and card usage.

  6. Zomato-Data-Analysis-Dashboard Zomato-Data-Analysis-Dashboard Public

    Interactive Zomato Data Analysis Dashboard built with Python and Streamlit, Data cleaning, EDA, visualizations, and map-based restaurant insights.

    Jupyter Notebook