0% found this document useful (0 votes)

8 views2 pages

Task 4

The document outlines a Python script for loading data, creating target and predictor variables, and training a Random Forest Regressor model using cross-validation. It includes functions for data loading, target creation, and model training with performance metrics calculated for each fold. The script utilizes libraries such as pandas and scikit-learn for data manipulation and machine learning tasks.

Uploaded by

rs9084156

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views2 pages

Task 4

Uploaded by

rs9084156

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

import pandas as pd

from sklearn.ensemble import RandomForestRegressor

from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_absolute_error
from sklearn.preprocessing import StandardScaler

# Load data
def load_data(path: str = "/path/to/csv/"):
"""
This function takes a path string to a CSV file and loads it into
a Pandas DataFrame.

:param path (optional): str, relative path of the CSV file

:return df: pd.DataFrame

"""

df = pd.read_csv(f"{path}")
df.drop(columns=["Unnamed: 0"], inplace=True, errors='ignore')
return df

# Create target variable and predictor variables

def create_target_and_predictors(
data: pd.DataFrame = None,
target: str = "estimated_stock_pct"
):
"""
This function takes in a Pandas DataFrame and splits the columns
into a target column and a set of predictor variables, i.e. X & y.
These two splits of the data will be used to train a supervised
machine learning model.

:param data: pd.DataFrame, dataframe containing data for the

model
:param target: str (optional), target variable that you want to
predict

:return X: pd.DataFrame
y: pd.Series
"""

# Check to see if the target variable is present in the data

if target not in data.columns:
raise Exception(f"Target: {target} is not present in the data")

X = data.drop(columns=[target])
y = data[target]
return X, y

# Train algorithm
def train_algorithm_with_cross_validation(
X: pd.DataFrame = None,
y: pd.Series = None
):
"""
This function takes the predictor and target variables and
trains a Random Forest Regressor model across K folds. Using
cross-validation, performance metrics will be output for each
fold during training.

:param X: pd.DataFrame, predictor variables

:param y: pd.Series, target variable

:return
"""

# Create a list that will store the accuracies of each fold

accuracy = []

# Enter a loop to run K folds of cross-validation

for fold in range(0, K):

# Instantiate algorithm and scaler

model = RandomForestRegressor()
scaler = StandardScaler()

# Create training and test samples

X_train, X_test, y_train, y_test = train_test_split(X, y,
train_size=SPLIT, random_state=42)

# Scale X data, we scale the data because it helps the algorithm to

converge
# and helps the algorithm to not be greedy with large values
scaler.fit(X_train)
X_train = scaler.transform(X_train)
X_test = scaler.transform(X_test)

# Train model
trained_model = model.fit(X_train, y_train)

# Generate predictions on test sample

y_pred = trained_model.predict(X_test)

# Compute accuracy, using mean absolute error

mae = mean_absolute_error(y_true=y_test, y_pred=y_pred)
accuracy.append(mae)
print(f"Fold {fold + 1}: MAE = {mae:.3f}")

# Finish by computing the average MAE across all folds

print(f"Average MAE: {(sum(accuracy) / len(ac

Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
CT Series
No ratings yet
CT Series
6 pages
Random Forest
No ratings yet
Random Forest
2 pages
AI ML - Cycle 2 Programs
No ratings yet
AI ML - Cycle 2 Programs
15 pages
Soft Sensor Code
No ratings yet
Soft Sensor Code
4 pages
Soft Sensor Code
No ratings yet
Soft Sensor Code
4 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
ML Fat
No ratings yet
ML Fat
9 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
AI
No ratings yet
AI
16 pages
Practicalpgm ML
No ratings yet
Practicalpgm ML
33 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
05 E RandomForest LoanData
No ratings yet
05 E RandomForest LoanData
8 pages
Coe Projects
No ratings yet
Coe Projects
7 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
Document 4
No ratings yet
Document 4
3 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
ML Internal Questions
No ratings yet
ML Internal Questions
15 pages
ML Codes
No ratings yet
ML Codes
9 pages
CP4252 Lab Manual
No ratings yet
CP4252 Lab Manual
13 pages
AML Code For m2
No ratings yet
AML Code For m2
7 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
DT R
No ratings yet
DT R
2 pages
ML External Xerox
No ratings yet
ML External Xerox
1 page
ML Lab-1
No ratings yet
ML Lab-1
32 pages
F 11
No ratings yet
F 11
3 pages
Code Structure
No ratings yet
Code Structure
6 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Predictive Modeling Machine Learning
No ratings yet
Predictive Modeling Machine Learning
16 pages
Da 012307
No ratings yet
Da 012307
8 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Sklearn
No ratings yet
Sklearn
141 pages
EXpt 3 ML2025
No ratings yet
EXpt 3 ML2025
3 pages
Untitled 57
No ratings yet
Untitled 57
4 pages
Logistic Regression
No ratings yet
Logistic Regression
3 pages
AI Note
No ratings yet
AI Note
5 pages
Online Payment Fraud Detection Using Machine Learning
No ratings yet
Online Payment Fraud Detection Using Machine Learning
2 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Models
No ratings yet
Models
2 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Modelling and Simulation Sample Model 4
No ratings yet
Modelling and Simulation Sample Model 4
3 pages
ML Practicals
No ratings yet
ML Practicals
11 pages
ML PDF
No ratings yet
ML PDF
30 pages
1
No ratings yet
1
13 pages
Assignment 4 Instructions
No ratings yet
Assignment 4 Instructions
4 pages
ML Internal 2
No ratings yet
ML Internal 2
7 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
NATM PPT Gall-Natm-Design-Construction PDF
No ratings yet
NATM PPT Gall-Natm-Design-Construction PDF
63 pages
Green Book
0% (1)
Green Book
22 pages
UoS BABS 3 HRM Assignment
No ratings yet
UoS BABS 3 HRM Assignment
15 pages
D3279-19 Asphaltene
No ratings yet
D3279-19 Asphaltene
4 pages
Untitled
No ratings yet
Untitled
4 pages
Game Changer - Record
No ratings yet
Game Changer - Record
3 pages
Types - Elstomeric Bearings
No ratings yet
Types - Elstomeric Bearings
4 pages
Fundamentals of Mathematics L-20
No ratings yet
Fundamentals of Mathematics L-20
3 pages
Ministry of Science and Technology Department of Science and Technology Science and Technology of Yoga and Meditation (SATYAM)
No ratings yet
Ministry of Science and Technology Department of Science and Technology Science and Technology of Yoga and Meditation (SATYAM)
2 pages
An Overview and Comparative Analysis of Recurrent Neural Networks For Short Term Load Forecasting
No ratings yet
An Overview and Comparative Analysis of Recurrent Neural Networks For Short Term Load Forecasting
41 pages
Product Conformity Certificate - O2000 Oxygen Analyser
No ratings yet
Product Conformity Certificate - O2000 Oxygen Analyser
9 pages
Structural Calculations - Cal PDF
No ratings yet
Structural Calculations - Cal PDF
117 pages
English Periodic Test Class XII Mock
No ratings yet
English Periodic Test Class XII Mock
3 pages
Jee Main 25 Jan Shift 1 Maths Memory Based Question Paper With Solution
No ratings yet
Jee Main 25 Jan Shift 1 Maths Memory Based Question Paper With Solution
7 pages
Resources and Development Practise Sheet 1
100% (1)
Resources and Development Practise Sheet 1
3 pages
Chemistry Investigatory Project
33% (3)
Chemistry Investigatory Project
11 pages
Career Opportunities - Food Security Cluster Coordinator - WFP
No ratings yet
Career Opportunities - Food Security Cluster Coordinator - WFP
4 pages
TiO2 APPLAB989092510 1
No ratings yet
TiO2 APPLAB989092510 1
3 pages
AppendixEL 02schedule D 2
No ratings yet
AppendixEL 02schedule D 2
428 pages
Disk and Drum Scheduling
100% (2)
Disk and Drum Scheduling
19 pages
Konica Monolta Drum (Photoconductor) DR512-DR512K
No ratings yet
Konica Monolta Drum (Photoconductor) DR512-DR512K
4 pages
Avalanche Formation and Characteristics
No ratings yet
Avalanche Formation and Characteristics
13 pages
Marine Microbiology Ecology and Applications by Colin Munn
100% (1)
Marine Microbiology Ecology and Applications by Colin Munn
394 pages
Unit 1 Family Life Lesson 2 Language
No ratings yet
Unit 1 Family Life Lesson 2 Language
76 pages
Best Ferrocement Structure 2016
No ratings yet
Best Ferrocement Structure 2016
7 pages
Art Appreciation - Assignment 1
No ratings yet
Art Appreciation - Assignment 1
1 page
Design of Heat Exchangers Using Aspen EDR
No ratings yet
Design of Heat Exchangers Using Aspen EDR
7 pages
Ansys Fluent Project in Advanced Fluid Mechanics
100% (1)
Ansys Fluent Project in Advanced Fluid Mechanics
28 pages
Strategic Choice Internal External Objectives Mission
No ratings yet
Strategic Choice Internal External Objectives Mission
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Task 4

Uploaded by

Task 4

Uploaded by

import pandas as pd

from sklearn.ensemble import RandomForestRegressor

:param path (optional): str, relative path of the CSV file

:return df: pd.DataFrame

# Create target variable and predictor variables

:param data: pd.DataFrame, dataframe containing data for the

# Check to see if the target variable is present in the data

:param X: pd.DataFrame, predictor variables

# Create a list that will store the accuracies of each fold

# Enter a loop to run K folds of cross-validation

# Instantiate algorithm and scaler

# Create training and test samples

# Scale X data, we scale the data because it helps the algorithm to

# Generate predictions on test sample

# Compute accuracy, using mean absolute error

# Finish by computing the average MAE across all folds

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.