Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dilly Dally GET data optimization with BIG DATA - Pagination vs get-everything-at-once #13

Open
astyltsvig opened this issue Feb 21, 2020 · 0 comments
Assignees

Comments

@astyltsvig
Copy link

Problem

We are currently developing a intelligent time registration system (DillyDally), where a employee of a given company can log their workhours and the system should create a nice overview of all worklogs. We are hosting our project in Google Firebase, and we are storing all our data in the NoSQL document database Firestore. At average one employee at Prolike submits 5-10 logs per day, it means the data will grow fast. If a company have 1k employees, they will generate approximately 2,6 million logs annually.

Problem 1:

The problem is we need to figure out how to fetch ALL worklogs from firestore, and the performance is the core requirement here. We need to analysis to how we can fetch 50k, 500k, 5 millions and 50 millions document smoothly.

Problem 2:*

If we dont fetch ALL data at once, how should we be able to calculate the sum of all workhours effectively?

Features

A comparison between scenarios measured in time

  • Fetching ALL data at once
  • Fetching data with pagination ( 100 at once )
    _ Your suggestion?

Benchmark

We should figure out when exactly it doesn't longer pay off to fetch all data at once compared to pagination.

Values

With this analysis, our system will now support a large-amount of entrietes, and now we can sell our system to customers.

Materials

@astyltsvig astyltsvig changed the title Dilly Dally GET data optimization - Pagination vs get-everything-at-once Dilly Dally GET data optimization with BIG DATA - Pagination vs get-everything-at-once Feb 21, 2020
@HeinLarsen HeinLarsen transferred this issue from devx-cafe/code-cafe-feb20 Mar 10, 2020
@LouBNielsen LouBNielsen self-assigned this Mar 16, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants