Skip to content
This repository was archived by the owner on Feb 5, 2022. It is now read-only.

CloudComputingCourse/719-p2.3-starter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

719-p2.3-starter

The starter package for 15-719/18-709 (Spring 2021) Project 2, part 3.

  • submit is used to run test for grading and submit your solution. Run it as ./submit <code-path> <test-id> <data-path>, the arguments are:
    • <code-path> is the local directory that contains your driver program and the run.sh script. It should contain nothing else.
    • <test-id> is the single letter (A, B, C) that identifies each test case described above. Please make sure the number of slave instances match the test specification or your grading will fail.
    • <data-path> is the path to your dataset on HDFS. e.g. /kdd10
  • spark_sparse_lr.py is the starter program that uses the broadcast-collect model for communicating model parameters (i.e. weights).
  • run.sh is just an example run.sh script. You can customize it as long as you believe that may help you solve the problem.
  • get_dataset.sh is the script that downloads the test case dataset and stores it in HDFS. Run it as ./get_dataset.sh <test-id>, where <test-id> is the single letter (A, B, C) that identifies the test case.

Pulling starter updates

  1. In case there're any updates in starter code, we will post patch files on Piazza. Make sure to check Piazza frequently. Once you get the .patch files, you can apply the patches on your code:
$ git apply <file>.patch
$ git diff # review changes
  1. If there are conflicts, you'll see messages showing "error: patch failed". Use cat <file>.patch to check the change and try to apply it manually. Please post on Piazza if you encounter any difficulties

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published