You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-14Lines changed: 9 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,20 +9,16 @@ Now your goal will be to make the SELECT queries fast.
9
9
10
10
For this assignment, we will work with 10 days of twitter data, about 31 million tweets.
11
11
This is enough data that indexes will dramatically improve query times,
12
-
but you won't have to wait hours/days to create each index and see if it works correctly.
12
+
but you won't have to wait days to create each index and see if it works correctly.
13
13
14
-
**due date:**~~Thursday 18 April~~
14
+
> **WARNING:**
15
+
> This assignment can put lots of load on the lambda server.
16
+
> Depending on the lambda server's current load, you may have to wait up to 12 hours for individual CREATE INDEX commands to run.
17
+
> If you wait until the last minute,
18
+
> you are very likely to not finish on time.
19
+
> THERE WILL BE NO EXTENSIONS FOR THIS ASSIGNMENT.
15
20
16
-
1. graduating students: ~~Sunday 21 April~~ Tuesday 30 April
17
-
18
-
I recommend it to be submitted before your final exam, so I can give you your final grade during the exam.
19
-
20
-
1. non-graduating students: Tuesday 30 April
21
-
22
-
This assignment can put lots of load on the lambda server.
23
-
My motivation for extending the due date for non-graduating students, is to have less contention for resources for the graduating students.
24
-
25
-
## Step 0: Prepare the repo/docker
21
+
## Step 0: Setup
26
22
27
23
1. Fork this repo, and clone your fork onto the lambda server.
28
24
@@ -43,7 +39,6 @@ but you won't have to wait hours/days to create each index and see if it works c
43
39
44
40
1. Notice that the `docker-compose.yml` file uses a [bind mount](https://docs.docker.com/storage/bind-mounts/) into your `$HOME/bigdata` directory whereas all of our previous assignments stored data into a [named volume](https://docs.docker.com/storage/volumes/).
45
41
46
-
47
42
This is necessary because in this assignment, you will be creating approximately 100GB worth of databases.
48
43
This won't fit in your home folder on the NVME drive (10G limit), and so you must put it into the HDD drives (250G limit).
49
44
@@ -91,7 +86,7 @@ but you won't have to wait hours/days to create each index and see if it works c
91
86
92
87
> **Hint:**
93
88
> If you need help deleting the data for whatever reason,
94
-
> let me know and I can delete it for you as a root user.
89
+
> let me (or the TA) know and we can delete it for you as a root user.
0 commit comments