Skip to content

ranvirm/scala-spark-titanic-example-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Overview

This project serves as an example of a scala-spark project using the Kaggle Titanic dataset

Usage

  1. Clone repository to local directory
  2. cd into directory
  3. Compile the project with sbt using:
sbt package
  1. Train model pipeline using:
spark-submit --class ModelTrain --master local[*] --driver-memory 4G target/scala-2.11/scalasparktitanicproject_2.11-1.0.jar
  1. Train model pipeline using:
spark-submit --class ModelPredict --master local[*] --driver-memory 4G target/scala-2.11/scalasparktitanicproject_2.11-1.0.jar

Notes

Predictions data will be saved as a csv file in the predictions directory found in project root dir

About

A Scala-Spark example project using the Kaggle Titanic dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy