🕸️ netics-assignment-1

This repository contains my submission for the first assignment in NETICS Open Recruitment.

p.s. gonna write this in EN, mostly bcs I write faster in EN, but also that I cba writing in ID lol.

Key	Val
Name	Faiz Muhammad Kautsar
Student ID (NRP)	5054231013
Deployment URL	https://netics-assignment-1.spuun.art/health
Status Page URL	https://hetrixtools.com/r/adbdb5c508a2763f1244b328bff6a0ce/
GCHR URL	ghcr.io/spuuntries/netics-assignment-1:latest

📝 preface

For this assignment, we were tasked with the deployment (as in, CI/CD) of a basic RESTful API with a /health route on GET. The example response provided within the spec goes as follows:

{
  "nama": "Tunas Bimatara Chrisnanta Budiman",
  "nrp": "5025231999",
  "status": "UP",
  "timestamp": time,	// Current time
  "uptime": time		// Server uptime
}

Seems simple enough, not much was provided with respect to the constraints regarding implementation details, so it was really left up to the participants to implement it however they'd like.

Deployment-wise, the doc expected some "best practices", though stayed somewhat "vague" with the specifics as well, so I assumed that this depends on the implementation details as well.

👩‍💻 implementation

To start things off, implementing the API, I implemented a basic FastAPI api,

app = FastAPI()
START_TIME = time.time()


@app.get("/health")
async def health_check():
    uptime_seconds = int(time.time() - START_TIME)
    uptime_formatted = humanize.precisedelta(
        timedelta(seconds=uptime_seconds), format="%d"
    )

    return {
        "nama": "Faiz Muhammad Kautsar",
        "nrp": "5054231013",
        "status": "UP",
        "timestamp": datetime.now().isoformat(),
        "uptime": uptime_formatted,
    }

then shoved that into uvicorn as its ASGI server.

⚙️ integration & deployment

To deploy the service, since I didn't have a VPS on hand to use for this ~~and I didn't want to use NPC's VPS lmoa~~, I used Render.

The CI/CD pipeline goes roughly as follows:

graph TD
    A[Push/PR to master] --> B[Test Job]
    B --> C{Is push to master?}
    C -->|No| D[End]
    C -->|Yes| E[Build and Push Job]
    E --> F[Deploy Job]
    F --> G[End]

    subgraph Test
    B --> B1[Checkout code]
    B1 --> B2[Setup Python]
    B2 --> B3[Install dependencies]
    B3 --> B4[Run pytest]
    end

    subgraph Build and Push
    E --> E1[Checkout code]
    E1 --> E2[Setup Docker Buildx]
    E2 --> E3[Login to GHCR]
    E3 --> E4[Extract Docker metadata]
    E4 --> E5[Build and push image]
    end

    subgraph Deploy
    F --> F1[Deploy to Render]
    end

You can see this implemented over at ./.github/workflows/deploy.yml, used GH Actions to get this to run.

The reason why I'd used this multi-stage thing was to implement that CI/CD workflow of unit testing, then building, then pushing. By isolating the testing stage off the build-and-push stage, this pipeline keeps the prod secure (as anything that fails won't proceed over to the push and deployment stage).

This "isolation" principle is kept in both the deployment and building stage. Case in point, to ensure that thorough testing is done on the entire workflow, I added another testing stage within the building stage in the Dockerfile (admittedly this is, in retrospect, a bit redundant lol, yes. However I'm still of the opinion that this thoroughness, given the tiny scope of the app, is all-in-all fine, though I would've removed it off the Dockerfile and have it in the workflow instead, otherwise they're about equivalent in utility though, since the workflow depends on the build working out anyway).

The building stage, in the Dockerfile, goes roughly as follows:

 graph TD
    A[Base Image: python:3.9-slim] --> B[Builder Stage]

    subgraph Builder Stage
    B --> B1[Create /build workspace]
    B1 --> B2[Copy requirements.txt]
    B2 --> B3[Install pip & dependencies]
    end

    B --> C[Testing Stage]
    subgraph Testing Stage
    C --> C1[Copy main.py]
    C1 --> C2[Copy tests/]
    C2 --> C3[Run pytest]
    end

    A --> D[Final Stage]
    subgraph Final Stage
    D --> D1[Create /app workspace]
    D1 --> D2[Copy dependencies from builder]
    D2 --> D3[Copy main.py]
    D3 --> D4[Create non-root user]
    D4 --> D5[Set up healthcheck]
    D5 --> D6[Configure environment]
    end

    style A fill:#f9f,stroke:#333
    style D fill:#9ff,stroke:#333

The way I set it up somewhat didn't assume the fact that I'd be deploying to Render tbh (hence that HEALTHCHECK), but it's really for the most part deployment env-agnostic, I'll try breaking it down. To build, it:

sets up a builder image from python:3.9-slim,
creates a /build directory,
copies the requirements file,
installs the deps.

Then, it moves over to the testing stage, with the same image:

copies the main.py,
copies the tests,
runs the tests via pytest.

Finally, it sets up the deployment image:

gets a new python:3.9-slim image,
copies the deps folder(s) from the builder image,
copies the main.py script,
creates a non-root user,
puts it on,
sets up a healthcheck,
sets up the CMD which acts as the entrypoint.

When it comes to the deployment stage in the workflow, it's structured in this way such that, again, there's that separation of concerns with respect to each stage in the integration process. Say the build worked out, but Render's environment went all broke on it, we can revert to the previous build by keeping the built image artifacts already on the GCHR repo.

Deploying over to Render works out like this:

image is built,
push to GCHR,
access Render's API to trigger a redeployment with a latest image.

Since Render's free thing has a sleeping thing like other hosting providers (e.g. Vercel or Netlify or Glitch or Replit or smth), I had to set up a pinger service to ensure that it doesn't sleep. The first thing that came to mind was Uptimerobot since I'd used it for a previous project before, but either I forgot or haven't heard about this, but they now only support HEAD requests to check for uptime, which, well, I can technically just, make the app respond to HEADs probably, but I cba, so instead we look for another.

In the end, I used HetrixTools to get it going. As far as I can tell, it's been running pretty solid, and as a bonus, I got a status page to go along with it :)) Albeit not a really nice looking one, but it's p good for literally free, so.

🗿 "best practices"

With implementation and deployment of the API out of the way, let's get to the "best practices" I implemented.

1. "unit testing"

Tbf, this isn't exactly "unit" testing, since the code is really just, practically just that route lol, however I did implement some testing via pytest here.

from main import app

client = TestClient(app)


def test_health_endpoint():
    response = client.get("/health")

    assert response.status_code == 200
    data = response.json()

    assert "nama" in data
    assert "nrp" in data
    assert "status" in data
    assert "timestamp" in data
    assert "uptime" in data

    assert data["nama"] == "Faiz Muhammad Kautsar"
    assert data["nrp"] == "5054231013"
    assert data["status"] == "UP"

    try:
        datetime.fromisoformat(data["timestamp"])
    except ValueError:
        pytest.fail("Timestamp is not in valid ISO format")

    print("All tests passed!")

2. "clear separation of concerns"

Okay, to be fair, this is smth I've already went over up there, but the idea with my submission is that we try to keep as much separation of concerns within the integration pipeline as possible.

Another example here is, let's look at that Dockerfile I implemented, when doing the build it runs it by two isolated environments.

# "Building"
FROM python:3.9-slim AS builder

# ... snip ...

# Testing
FROM builder AS testing

# ... snip ...

# Deploy
FROM python:3.9-slim

# ... snip ...

This is to keep the "deployment" environment clean of the "building" environment, if something weird was modified during the building or testing process of the Dockerfile, the deployment image stays clean.

3. secure deployment

To ensure that the app's deployed "securely", I set up a low-privilege user, this is applying that principle of least privilege when it comes to deploying things securely. This is so that, for the most part, if the app was compromised (I'd be surprised if it did tbh aksndals), the access scope is relatively limited to that low-privilege user.

4. incident response

Technically, out of scope, but I was pretty happy with the bonus thing of deploying w/ Hetrix lol, so, um, well, the deployment now has pretty good incident response set up, the thing has alerts for when it goes down and has a status page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🕸️ netics-assignment-1

📝 preface

👩‍💻 implementation

⚙️ integration & deployment

🗿 "best practices"

1. "unit testing"

2. "clear separation of concerns"

3. secure deployment

4. incident response

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

spuuntries/netics-assignment-1

Folders and files

Latest commit

History

Repository files navigation

🕸️ netics-assignment-1

📝 preface

👩‍💻 implementation

⚙️ integration & deployment

🗿 "best practices"

1. "unit testing"

2. "clear separation of concerns"

3. secure deployment

4. incident response

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages