0% found this document useful (0 votes)

53 views10 pages

LSTM - Ipynb - Colab

The document outlines a Jupyter notebook for analyzing electricity consumption data using LSTM (Long Short-Term Memory) neural networks. It includes steps for data preprocessing, outlier detection, feature engineering, and model training, along with visualization of results. Key techniques employed include seasonal decomposition, IQR for outlier detection, and scaling of features for the LSTM model.

Uploaded by

ashupandul754

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views10 pages

LSTM - Ipynb - Colab

Uploaded by

ashupandul754

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

3/27/25, 11:02 PM LSTM.

ipynb - Colab

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from google.colab import drive

drive.mount("/content/drive")

Mounted at /content/drive

df = pd.read_excel("/content/drive/MyDrive/data_2017_4_2023.xlsx",index_col = 'Parameter'

df.head()

Electricity(in MU)

Parameter

2017-04-01 162.1

2017-04-02 161.3

2017-04-03 162.2

2017-04-04 164.0

2017-04-05 165.2

Next steps: Generate code with df toggle_off View recommended plots New interactive sheet

df.plot(figsize = (12,6))

https://colab.research.google.com/drive/1XEGrJRqqFy7kG6MkzFpxVW99XGreospe#scrollTo=3sVIjWPBo4W7&printMode=true 1/10
3/27/25, 11:02 PM LSTM.ipynb - Colab

<Axes: xlabel='Parameter'>

from statsmodels.tsa.seasonal import seasonal_decompose

result = seasonal_decompose(df['Electricity(in MU)'], model='additive', period=365)

# Plot to visualize
result.plot()

import numpy as np

residual_std = np.std(result.resid.dropna()) # drop NaN because trend/seasonal edges wil

threshold = 2 * residual_std

# Flag points where residual exceeds threshold

df['outlier'] = np.abs(result.resid) > threshold

# Replace outliers with trend + seasonal (without noisy residual)

df['cleaned_electricity'] = df['Electricity(in MU)'] # copy original

df.loc[df['outlier'], 'cleaned_electricity'] = (result.trend + result.seasonal)[df['outli

import matplotlib.pyplot as plt

plt.figure(figsize=(12, 6))
plt.plot(df['Electricity(in MU)'], label='Original Data', alpha=0.6)
plt.plot(df['cleaned_electricity'], label='Outlier Removed (Trend + Seasonal)', alpha=0.9
plt.legend()
plt.show()

Seasonal decomposition (seasonal_decompose()) is sensitive to incomplete seasonal periods

at the edges. At the very end of the time series, the decomposition model cannot properly
calculate the seasonal component (because it doesn't have future data to detect full seasonal
cycles). This is why the residuals at the start and end of the data are often inaccurate or
missing. As a result, outliers near the end (or start) may not be detected correctly.

apply IQR (Interquartile Range) outlier detection specifically for the last month of 2023. as there
is as visible outlier

# Filter December 2023

last_month = df.loc['2023-12'].copy()

# IQR Calculation
Q1 = last_month['Electricity(in MU)'].quantile(0.25)
Q3 = last_month['Electricity(in MU)'].quantile(0.75)
IQR = Q3 - Q1
lower_bound = Q1 - 1.5 * IQR
upper_bound = Q3 + 1.5 * IQR

# Detect outliers
last_month['is_outlier'] = (last_month['Electricity(in MU)'] < lower_bound) | (last_month

# Replace outliers with median

median_value = last_month['Electricity(in MU)'].median()
last_month.loc[last_month['is_outlier'], 'cleaned_electricity'] = median_value

df.update(last_month)

df.tail()

Electricity(in MU) outlier cleaned_electricity

Parameter

2024-12-27 182.07 False 182.07

2024-12-28 187.75 False 187.75

2024-12-29 188.07 False 188.07

2024-12-30 195.84 False 195.84

2024-12-31 201.81 False 201.81

# Assuming 'df' has DateTime index and 'Electricity' column

df['month'] = df.index.month
df['dayofyear'] = df.index.dayofyear
df['dayofweek'] = df.index.dayofweek
df['is_weekend'] = (df['dayofweek'] >= 5).astype(int)

# Fourier Features for Yearly Seasonality (365 days)

df['sin_day'] = np.sin(2 * np.pi * df['dayofyear'] / 365)
df['cos_day'] = np.cos(2 * np.pi * df['dayofyear'] / 365)

Electricity(in
outlier cleaned_electricity month dayofyear dayofweek
MU)

Parameter

2017-04-
162.10 False 162.10 4 91 5
01

2017-04-
161.30 False 161.30 4 92 6
02

2017-04-
162.20 False 162.20 4 93 0
03

2017-04-
164.00 False 164.00 4 94 1
04

2017-04-
165.20 False 165.20 4 95 2
05

... ... ... ... ... ... ...

2024-12-
182.07 False 182.07 12 362 4
27

2024-12-
187 75 False 187 75 12 363 5
 

Next steps: Generate code with df toggle_off View recommended plots New interactive sheet

from sklearn.preprocessing import RobustScaler

scaler = RobustScaler()
df['scaled_electricity'] = scaler.fit_transform(df['cleaned_electricity'].values.reshape(

features = ['scaled_electricity', 'month', 'dayofyear', 'dayofweek', 'is_weekend', 'sin_d

# Convert DataFrame into supervised learning format (sliding window creation)

def create_lstm_data(df, features, time_steps=30):
X, y = [], []
for i in range(len(df) - time_steps):
X.append(df[features].iloc[i:i + time_steps].values)
y.append(df['scaled_electricity'].iloc[i + time_steps])
return np.array(X), np.array(y)

#30-day sliding window (keep your time_steps = 30)

time_steps = 30
X, y = create_lstm_data(df, features, time_steps)

print(f"X shape: {X.shape}, y shape: {y.shape}")

X shape: (2802, 30, 7), y shape: (2802,)

# Train-Test Split (80% train, 20% test)

split_index = int(0.8 * len(X))
X_train, X_test = X[:split_index], X[split_index:]
https://colab.research.google.com/drive/1XEGrJRqqFy7kG6MkzFpxVW99XGreospe#scrollTo=3sVIjWPBo4W7&printMode=true 6/10
3/27/25, 11:02 PM LSTM.ipynb - Colab

y_train, y_test = y[:split_index], y[split_index:]

print(f"Train shape: X={X_train.shape}, y={y_train.shape}")

print(f"Test shape: X={X_test.shape}, y={y_test.shape}")

Train shape: X=(2241, 30, 7), y=(2241,)

Test shape: X=(561, 30, 7), y=(561,)

from sklearn.model_selection import train_test_split

# Split train into train and validation (e.g., 80% train, 20% validation)
X_train_final, X_val, y_train_final, y_val = train_test_split(
X_train, y_train, test_size=0.2, shuffle=False # No shuffle for time series!
)

print(f"Train shape: X={X_train_final.shape}, y={y_train_final.shape}")

print(f"Validation shape: X={X_val.shape}, y={y_val.shape}")
print(f"Test shape: X={X_test.shape}, y={y_test.shape}")

Train shape: X=(1792, 30, 7), y=(1792,)

Validation shape: X=(449, 30, 7), y=(449,)
Test shape: X=(561, 30, 7), y=(561,)

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import LSTM, Dense, Dropout
from tensorflow.keras.optimizers import Adam
from tensorflow.keras.callbacks import EarlyStopping, ReduceLROnPlateau

model = Sequential()

# 1st LSTM layer (stacked)

model.add(LSTM(64, activation='tanh', return_sequences=True, input_shape=(30, 7)))
model.add(Dropout(0.3))

# 2nd LSTM layer

model.add(LSTM(32, activation='tanh'))
model.add(Dropout(0.3))

# Final dense output

model.add(Dense(1))

# Compile with Huber Loss (more robust to outliers)

model.compile(optimizer=Adam(learning_rate=0.001), loss='huber', metrics=['mae'])

# Callbacks
early_stopping = EarlyStopping(monitor='val_loss', patience=15, restore_best_weights=True
reduce_lr = ReduceLROnPlateau(monitor='val_loss', patience=7, factor=0.5, verbose=1)

# Train
history = model.fit(X_train_final, y_train_final,
validation_data=(X_val, y_val),
epochs=50,
batch_size=32,

callbacks=[early_stopping, reduce_lr])

56/56 ━━━━━━━━━━━━━━━━━━━━ 2s 33ms/step - loss: 0.0269 - mae: 0.1786 - val_loss: 0 

Epoch 23/50
56/56 ━━━━━━━━━━━━━━━━━━━━ 2s 24ms/step - loss: 0.0271 - mae: 0.1819 - val_loss: 0
Epoch 24/50
56/56 ━━━━━━━━━━━━━━━━━━━━ 4s 44ms/step - loss: 0.0252 - mae: 0.1776 - val_loss: 0
Epoch 25/50
56/56 ━━━━━━━━━━━━━━━━━━━━ 1s 24ms/step - loss: 0.0308 - mae: 0.1947 - val_loss: 0
Epoch 26/50
56/56 ━━━━━━━━━━━━━━━━━━━━ 1s 23ms/step - loss: 0.0275 - mae: 0.1803 - val_loss: 0
Epoch 27/50
56/56 ━━━━━━━━━━━━━━━━━━━━ 3s 40ms/step - loss: 0.0254 - mae: 0.1744 - val_loss: 0
Epoch 28/50


https://colab.research.google.com/drive/1XEGrJRqqFy7kG6MkzFpxVW99XGreospe#scrollTo=3sVIjWPBo4W7&printMode=true 8/10
3/27/25, 11:02 PM LSTM.ipynb - Colab

import matplotlib.pyplot as plt

plt.plot(history.history['loss'], label='Train Loss')

plt.plot(history.history['val_loss'], label='Val Loss')
plt.legend()
plt.show()

y_pred_scaled = model.predict(X_test)
y_pred_original = scaler.inverse_transform(y_pred_scaled.reshape(-1, 1)).flatten()

y_test_original = scaler.inverse_transform(y_test.reshape(-1, 1)).flatten()

18/18 ━━━━━━━━━━━━━━━━━━━━ 1s 27ms/step

import matplotlib.pyplot as plt

plt.figure(figsize=(12,6))
plt.plot(y_test_original, label='Actual', marker='o', linestyle='-')
plt.plot(y_pred_original, label='Predicted', marker='x', linestyle='--')
plt.xlabel('Time Step')
plt.ylabel('Electricity Consumption (Original Scale)')
plt.title('Actual vs Predicted on Test Data (Original Scale)')
plt.legend()
plt.show()


 

https://colab.research.google.com/drive/1XEGrJRqqFy7kG6MkzFpxVW99XGreospe#scrollTo=3sVIjWPBo4W7&printMode=true 10/10

Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
Downtime Forecasting with ML Models
No ratings yet
Downtime Forecasting with ML Models
17 pages
List of Imported Libraries
No ratings yet
List of Imported Libraries
12 pages
RNN LSTM Example Implementations With Keras TensorFlow
No ratings yet
RNN LSTM Example Implementations With Keras TensorFlow
20 pages
Code Structure
No ratings yet
Code Structure
6 pages
MiniProject - ML - Ipynb - Colaboratory
No ratings yet
MiniProject - ML - Ipynb - Colaboratory
26 pages
3 Steps To Forecast Time Series - LSTM With TensorFlow Keras - Towards Data Science
No ratings yet
3 Steps To Forecast Time Series - LSTM With TensorFlow Keras - Towards Data Science
16 pages
Asset Data Analysis
No ratings yet
Asset Data Analysis
47 pages
Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras
No ratings yet
Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras
9 pages
Image Processing
No ratings yet
Image Processing
5 pages
Univariate and Mutivariate Time Series Forecasting
No ratings yet
Univariate and Mutivariate Time Series Forecasting
33 pages
Multivariate Time Series Anomaly Detection
No ratings yet
Multivariate Time Series Anomaly Detection
4 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
7 pages
Stock Price Prediction Using LSTM CodingSaathi
No ratings yet
Stock Price Prediction Using LSTM CodingSaathi
9 pages
Smart Factory Energy Prediction - Ipynb
No ratings yet
Smart Factory Energy Prediction - Ipynb
355 pages
Lstm-Load-Forecasting:6 - All - Features - Ipynb at Master Dafrie:lstm-Load-Forecasting GitHub
No ratings yet
Lstm-Load-Forecasting:6 - All - Features - Ipynb at Master Dafrie:lstm-Load-Forecasting GitHub
5 pages
4p Code
No ratings yet
4p Code
3 pages
CTRL
No ratings yet
CTRL
5 pages
FAIR PINN Clean
No ratings yet
FAIR PINN Clean
1 page
Keras Deep Learning Cheat Sheet
No ratings yet
Keras Deep Learning Cheat Sheet
1 page
Notebook - Main Code
No ratings yet
Notebook - Main Code
4 pages
Tesla Stock Data Analysis
No ratings yet
Tesla Stock Data Analysis
7 pages
DeepTrading With TensorFlow 4 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 4 - TodoTrader
14 pages
Python Library Functions Overview
No ratings yet
Python Library Functions Overview
12 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
Data Preprocessing
No ratings yet
Data Preprocessing
9 pages
Comprehensive Overview of Common ML Techniques
No ratings yet
Comprehensive Overview of Common ML Techniques
7 pages
Steel Plant Energy Usage Analysis
No ratings yet
Steel Plant Energy Usage Analysis
19 pages
Reproducibility Project
No ratings yet
Reproducibility Project
22 pages
Personalized Cancer Diagnosis
No ratings yet
Personalized Cancer Diagnosis
100 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Swaraj Gaikwad Case Study Analyzing Fault in New Energy Vehicle (Electric Vehicle)
No ratings yet
Swaraj Gaikwad Case Study Analyzing Fault in New Energy Vehicle (Electric Vehicle)
3 pages
Electricity Consumption Prediction
No ratings yet
Electricity Consumption Prediction
4 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
ML NEW Final Format
No ratings yet
ML NEW Final Format
37 pages
ML Lab Manual Sem-7
No ratings yet
ML Lab Manual Sem-7
31 pages
Basic Data Prep and Pre-Processing
No ratings yet
Basic Data Prep and Pre-Processing
12 pages
Data Science with Python Tools
No ratings yet
Data Science with Python Tools
1 page
Code Shabab Error 7
No ratings yet
Code Shabab Error 7
5 pages
Import As: Pandas PD DF PD - Read - CSV DF - Head
No ratings yet
Import As: Pandas PD DF PD - Read - CSV DF - Head
91 pages
# (Data Preprocessing) : (Cheatsheet)
No ratings yet
# (Data Preprocessing) : (Cheatsheet)
10 pages
Ipynb - Colab01
No ratings yet
Ipynb - Colab01
4 pages
Data Modeling - Cheatsheet
No ratings yet
Data Modeling - Cheatsheet
9 pages
Python Machine Learning Workshop Guide
No ratings yet
Python Machine Learning Workshop Guide
36 pages
Time Series Forecasting With 2D Convolutions
No ratings yet
Time Series Forecasting With 2D Convolutions
33 pages
Rainfall Prediction Using Machine Learning
No ratings yet
Rainfall Prediction Using Machine Learning
9 pages
20AI16 - ML Record
No ratings yet
20AI16 - ML Record
24 pages
S3 Data Processing and Classification
No ratings yet
S3 Data Processing and Classification
25 pages
Classification Techniques in Python
No ratings yet
Classification Techniques in Python
30 pages
Minor Project
No ratings yet
Minor Project
21 pages
Activity Unit 3
No ratings yet
Activity Unit 3
5 pages
Curve Intersection Finder
No ratings yet
Curve Intersection Finder
5 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
8 pages
Course Structure BArch 2017-22 PDF
No ratings yet
Course Structure BArch 2017-22 PDF
106 pages
Chapter 17 AIS
No ratings yet
Chapter 17 AIS
17 pages
List of Pharmaceutical Companies in - HTML
67% (3)
List of Pharmaceutical Companies in - HTML
12 pages
Archclass 8 Advance Imaging (Solved)
No ratings yet
Archclass 8 Advance Imaging (Solved)
2 pages
Chapter 08
No ratings yet
Chapter 08
22 pages
Guide To Listening Comprehension: Part A: Dialogs (Short Conversation) Questions Types
No ratings yet
Guide To Listening Comprehension: Part A: Dialogs (Short Conversation) Questions Types
2 pages
Ds-2cd3d46g2t-Izhs - Datasheet - v5.5.150 - 20201208 Mini Domo CNCH Op2
No ratings yet
Ds-2cd3d46g2t-Izhs - Datasheet - v5.5.150 - 20201208 Mini Domo CNCH Op2
7 pages
Esd Seminar
No ratings yet
Esd Seminar
11 pages
DocumentationGuide MICROSAR
No ratings yet
DocumentationGuide MICROSAR
1 page
Culture Health Intro CH 1 R
No ratings yet
Culture Health Intro CH 1 R
69 pages
Mini Monitor Module Installation Guide: Troubleshooting
No ratings yet
Mini Monitor Module Installation Guide: Troubleshooting
2 pages
Sample Paper Questions - NLP (Part 1)
No ratings yet
Sample Paper Questions - NLP (Part 1)
7 pages
Frontend Development - Intermediate Level
No ratings yet
Frontend Development - Intermediate Level
10 pages
BCP 10B Documentation Reference Manual
No ratings yet
BCP 10B Documentation Reference Manual
166 pages
338138devops For Web Development Mitesh Soni Download
100% (1)
338138devops For Web Development Mitesh Soni Download
26 pages
Block Chain Summary & MCQ
No ratings yet
Block Chain Summary & MCQ
26 pages
Session 4 Qualitative Analysis Advanced
No ratings yet
Session 4 Qualitative Analysis Advanced
53 pages
f01 Training Activity Matrix
100% (1)
f01 Training Activity Matrix
2 pages
Res2Dinvx64: With Multi-Core and 64-Bit Support For Windows Xp/Vista/7/8
No ratings yet
Res2Dinvx64: With Multi-Core and 64-Bit Support For Windows Xp/Vista/7/8
13 pages
Television Production
No ratings yet
Television Production
3 pages
DS WhitePapers Working With Derived Format Converter
No ratings yet
DS WhitePapers Working With Derived Format Converter
58 pages
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
No ratings yet
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
43 pages
Technology Audit
No ratings yet
Technology Audit
11 pages
Barani Institute of Science Sahiwal: Information and Communication Technoligy
No ratings yet
Barani Institute of Science Sahiwal: Information and Communication Technoligy
6 pages
Unit 3
No ratings yet
Unit 3
32 pages
AMX Quick Start Guide
No ratings yet
AMX Quick Start Guide
1 page
Pengenalan Sistem Audio & Video
No ratings yet
Pengenalan Sistem Audio & Video
41 pages