Skip to content

Java required for Spark unit testing lab and homework #303

Open
@Ho1yShif

Description

@Ho1yShif

Java Issue

  • In the beginning of the Unit Testing Spark Jobs lab, you show that students are ready to begin when 3 pytest tests pass
  • However, these tests all fail without a Java environment
  • Would it be possible to include a Java environment as part of the setup requirements for the Spark module? If not, I can update the README with this info too

Bug fix

  • Even after Java is installed, there appears to be an error in the do_monthly_user_site_hits_transformation function in bootcamp/materials/3-spark-fundamentals/src/jobs/monthly_user_site_hits_job.py. These SUM COALESCE statements need to include get() functions; otherwise they error
    SELECT
           month_start,
           SUM(COALESCE(get(hit_array, 0), 0)) as num_hits_first_day,
           SUM(COALESCE(get(hit_array, 1), 0)) AS num_hits_second_day,
           SUM(COALESCE(get(hit_array, 2), 0)) as num_hits_third_day
    FROM monthly_user_site_hits

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      pFad - Phonifier reborn

      Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

      Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


      Alternative Proxies:

      Alternative Proxy

      pFad Proxy

      pFad v3 Proxy

      pFad v4 Proxy