Create a Dockerfile to ease development #66

gkokolatos · 2021-03-18T11:54:03Z

Additionally document in code the required setup and have an example of use. It
is by no means intended to be run in production.

The base image is from ubuntu 18.04 LTS which is the most recent version of
ubuntu that can match the python/pip/virtualenv requirements. For example ubuntu
20.04 will require Pillow 6.2 instead of 5.4.1 and virtualenv will have to be
launched without the -no-site-packages option.

The setup is based on the name pgeusystem which is used throughout, e.g. admin
name, db name etc.

Some basic instructions on how to build, run and use the image are added to the
README on the devsetup directory, which by itself is now listed in the root
README file to provide visibility.

Additionally document in code the required setup and have an example of use. It is by no means intended to be run in production. The base image is from ubuntu 18.04 LTS which is the most recent version of ubuntu that can match the python/pip/virtualenv requirements. For example ubuntu 20.04 will require Pillow 6.2 instead of 5.4.1 and virtualenv will have to be launched without the -no-site-packages option. The setup is based on the name pgeusystem which is used throughout, e.g. admin name, db name etc. Some basic instructions on how to build, run and use the image are added to the README on the devsetup directory, which by itself is now listed in the root README file to provide visibility.

mhagander · 2021-03-18T12:12:52Z

The biggest deployments of the system, being the PGEU and PGUS systems, are all running on top of Debian Buster rather than Ubuntu. I wonder if it might be a better idea to base the dockerfile off that?

In those deployments we also drive the majority of the packages off the DEB packages rather than manual installations. That's where most of the version definitions in the original requirements come from.

As for the usecase -- I think it'd be good to at least have an option where it runs on top of a git checkout and uses the code there, rather than deploys the app itself into the container. That is, mount the git root from the host into the container and then run it. As I understand it from the suggested Dockerfile, it requires me to rebuild the container every time I change a file -- I'd like it to just run off the file as it is outside the container in many cases.

gkokolatos · 2021-03-18T12:58:15Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

On Thursday, March 18, 2021 1:13 PM, Magnus Hagander ***@***.***> wrote: The biggest deployments of the system, being the PGEU and PGUS systems, are all running on top of Debian Buster rather than Ubuntu. I wonder if it might be a better idea to base the dockerfile off that?

Sure, that should be doable. Yet, because it runs on Debian Buster, should I develop in that too? No worries, I shall change it.

In those deployments we also drive the majority of the packages off the DEB packages rather than manual installations. That's where most of the version definitions in the original requirements come from.

Sure. Let's try to find them.

As for the usecase -- I think it'd be good to at least have an option where it runs on top of a git checkout and uses the code there, rather than deploys the app itself into the container. That is, mount the git root from the host into the container and then run it. As I understand it from the suggested Dockerfile, it requires me to rebuild the container every time I change a file -- I'd like it to just run off the file as it is outside the container in many cases.

To be honest, the intended usage, at least for me, was to have an easy way to create a development environment elsewhere, e.g. have a list of steps for vagrant + ansible. Not to work directly off the Dockerfile. But I guess people do work off Dockerfiles. Yeah, that COPY is intentional and set at that early step as is it will invalidate the docker cache every time any file changes, including the Dockerfile itself. That way one will make certain that it will not run against a stale installation. Usually on C based projects, one will have a build directory. Adding a make command takes care of the caching problem. I can change it though, not a problem.

…

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

* Now the base image is Debian Buster. * Packages are installed from debs, not manually * The build stage does not verify for correctness, it marely prepares packages and the system. * There exists the ability to mount the volume in the container, as it is not copied * There exists the ability to run with the mounted volume, and to not need to rebuild for minor changes

andreasscherbaum · 2021-03-18T16:24:26Z

I agree that it might be a better choice to run this on debian.

Related ticket: #65 (for the virtualenv deprecated option)

gkokolatos · 2021-03-18T16:28:19Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

On Thursday, March 18, 2021 5:24 PM, Andreas Scherbaum ***@***.***> wrote: I agree that it might be a better choice to run this on debian.

No probs. A new version is pushed and should have reached GitHub (or is reaching shortly)

…

Related ticket: [#65](#65) (for the virtualenv deprecated option) — You are receiving this because you authored the thread. Reply to this email directly, [view it on GitHub](#66 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/ALBTIOYEW7BHUSVWRRVQ7CDTEISM7ANCNFSM4ZMPCCRQ).

gkokolatos · 2021-03-24T10:12:10Z

@mhagander, @andreasscherbaum

Any comments on the updated version?

andreasscherbaum · 2021-03-27T18:11:42Z

tools/devsetup/Dockerfile

+		  set -e\n\
+		  pg_ctl -D /opt/pgeusystem/pgdata -l /opt/pgeusystem/pgdata/logfile start\n\
+		  pushd /opt/pgeusystem/app/\n\
+	      ./tools/devsetup/dev_setup.sh localhost 5432 pgeusystem pgeusystem\n\


My inner monk is confused by the missing spaces here ;-)

I am confused by your confusion. Sorry, which missing spaces are you referring to?

The spaces before "./tools", which are fewer than the other ones. But that's just eye candy.

Got it. Will fix :)

andreasscherbaum · 2021-03-27T18:12:59Z

tools/devsetup/README.txt

+If the above commands are successfull then one can reach the index page of the
+app in http://localhost:8012
+
+If the user so wishes, can reach the database in the running container via:


Does that mean you have to login into the container first?
Then it would be useful to add the command how to do that.

I suggest above to run the image with the --network host parameter. When that is done, and is successful, then it is not necessary to login into the container first.

Ah, perfect.

andreasscherbaum · 2021-03-27T18:13:18Z

Haven't tested the image (have a local version running). Left two comments, otherwise this looks good.

gkokolatos · 2021-03-29T08:00:26Z

Haven't tested the image (have a local version running). Left two comments, otherwise this looks good.

Thank you for looking!

* Fix whitespace missmatch during .sh content generation

mhagander · 2021-03-31T14:56:11Z

Is just me not reading it right, or does this re-run the devsetup script every item the container is started? Including creating a new virtualenv every time?

First, doesn't that seem like something that should be run just once? Otherwise you have to do things like create the superuser over and over again?

Second, why create a virtualenv inside the docker container. That seems like doubling things?

rjuju · 2021-03-31T16:33:52Z

Also, why not exposing the postgres and django ports rather than using host network? It would allow users to let them map those to a local port if they wish to.

gkokolatos · 2021-03-31T16:35:32Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

On Wednesday, March 31, 2021 4:56 PM, Magnus Hagander ***@***.***> wrote: Is just me not reading it right, or does this re-run the devsetup script every item the container is started? Including creating a new virtualenv every time?

Only if you choose to, and yes otherwise which is also the default.

First, doesn't that seem like something that should be run just once? Otherwise you have to do things like create the superuser over and over again? Second, why create a virtualenv inside the docker container. That seems like doubling things?

I understand your argument and is valid. Allow me to be clear on the intention though, which is documentation as code. The first time I checkout this codebase I spend hours trying to make it run on my out of the box ubuntu 20.04 machine. Finding out which packages to install, which to downgrade, which to update along with all the other assumptions made. This is nothing new to any codebase for sure, yet it can be a bit better. The only thing this dockerfile is trying to do, is to show which OS is favoured, in which version, with which packages installed and what steps to take in order to run it. If what I would use to run locally does create a virtualenv, then so be it. Of course there are many other ways of documenting infrastructure, this is simply a low hanging fruit. A next step if this or a version of it gets in, would be to create a minimal travis script that will also make certain that the dockerfile manages to run.

…

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

gkokolatos · 2021-03-31T16:39:49Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

On Wednesday, March 31, 2021 6:34 PM, Julien Rouhaud ***@***.***> wrote: Also, why not exposing the postgres and django ports rather than using host network? It would allow users to let them map those to a local port if they wish to.

It is by no means necessary, it is an example of one of the many many available build options. I find it to be the most straight forward. Users that do know and use docker will choose their own. Users that are not so comfortable with it, they will not get confused by all the additional arguments in the example. Or that is what I thought.

…

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

rjuju · 2021-03-31T16:53:10Z

On Wednesday, March 31, 2021 6:34 PM, Julien Rouhaud @.***> wrote: Also, why not exposing the postgres and django ports rather than using host network? It would allow users to let them map those to a local port if they wish to.
It is by no means necessary, it is an example of one of the many many available build options. I find it to be the most straight forward. Users that do know and use docker will choose their own. Users that are not so comfortable with it, they will not get confused by all the additional arguments in the example. Or that is what I thought.

I'm talking about a simple EXPOSE directive, to make it clear what ports are used. If I'm not mistaken using --network host will silently ignore already ports that are already used on the host , so I'm worried that in many case people trying to run this image will connect to their local postgres instance rather than the container one.

mhagander · 2021-04-21T13:36:10Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
On Wednesday, March 31, 2021 4:56 PM, Magnus Hagander @.***> wrote: Is just me not reading it right, or does this re-run the devsetup script every item the container is started? Including creating a new virtualenv every time?
Only if you choose to, and yes otherwise which is also the default.

I'm not entirely sure what you mean here. But supporting both methods are definitely good -- but we should probably mention them in the README in that case?

First, doesn't that seem like something that should be run just once? Otherwise you have to do things like create the superuser over and over again? Second, why create a virtualenv inside the docker container. That seems like doubling things?
I understand your argument and is valid. Allow me to be clear on the intention though, which is documentation as code. The first time I checkout this codebase I spend hours trying to make it run on my out of the box ubuntu 20.04 machine. Finding out which packages to install, which to downgrade, which to update along with all the other assumptions made. This is nothing new to any codebase for sure, yet it can be a bit better. The only thing this dockerfile is trying to do, is to show which OS is favoured, in which version, with which packages installed and what steps to take in order to run it. If what I would use to run locally does create a virtualenv, then so be it. Of course there are many other ways of documenting infrastructure, this is simply a low hanging fruit. A next step if this or a version of it gets in, would be to create a minimal travis script that will also make certain that the dockerfile manages to run.

I definitely agree with this in principle.

But my comment is, you are now installing things twice. Once in the container and then again in the virtualenv inside said container.

I think it should either do system for what it can and virtualenv for what's needed (that's what we do in the pgeu deployments, and the only things that actually go int he virtualenv are django itself and qrencode, (and django-markdown django-markwhat, but I dont't hnk those are needed anymore so we should remove them rom requirements.txt). Or we should do everything from virtualenv and not do anything as apt packages.

The first option (keeping most in system packages) would certainly be the easiest no?

gkokolatos · 2021-04-22T07:21:41Z

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

On Wednesday, 21 April 2021 15:36, Magnus Hagander ***@***.***> wrote: > ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐ > On Wednesday, March 31, 2021 4:56 PM, Magnus Hagander @.***> wrote: Is just me not reading it right, or does this re-run the devsetup script every item the container is started? Including creating a new virtualenv every time? > Only if you choose to, and yes otherwise which is also the default. I'm not entirely sure what you mean here. But supporting both methods are definitely good -- but we should probably mention them in the README in that case?

Sure, I can add a line.

> First, doesn't that seem like something that should be run just once? Otherwise you have to do things like create the superuser over and over again? Second, why create a virtualenv inside the docker container. That seems like doubling things? > I understand your argument and is valid. Allow me to be clear on the intention though, which is documentation as code. The first time I checkout this codebase I spend hours trying to make it run on my out of the box ubuntu 20.04 machine. Finding out which packages to install, which to downgrade, which to update along with all the other assumptions made. This is nothing new to any codebase for sure, yet it can be a bit better. The only thing this dockerfile is trying to do, is to show which OS is favoured, in which version, with which packages installed and what steps to take in order to run it. If what I would use to run locally does create a virtualenv, then so be it. Of course there are many other ways of documenting infrastructure, this is simply a low hanging fruit. A next step if this or a version of it gets in, would be to create a minimal travis script that will also make certain that the dockerfile manages to run. I definitely agree with this in principle. But my comment is, you are now installing things twice. Once in the container and then again in the virtualenv inside said container.

It is true.

I think it should either do system for what it can and virtualenv for what's needed (that's what we do in the pgeu deployments, and the only things that actually go int he virtualenv are django itself and qrencode, (and django-markdown django-markwhat, but I dont't hnk those are needed anymore so we should remove them rom requirements.txt). Or we should do everything from virtualenv and not do anything as apt packages.

Rights. I think this is the gist of the story. I want to treat the container as the machine that things are deployed to. If we can get the docker to actually match that, I would be very happy. So, since I do not have access to the deployment machines (nor I require one), can I get a list of the installed packages via system, and the list of things in the virtual env? If later it is decided for things to be done differently and some packages are removed, then we remove them from the docker image. Does the above make sense?

The first option (keeping most in system packages) would certainly be the easiest no?

It certainly would be. A minor side note about my commenting style and language. I have no idea how it comes across, please just assume that I am smiling and I am jolly happy with any comment I receive.

…

— You are receiving this because you authored the thread. Reply to this email directly, [view it on GitHub](#66 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/ALBTIO65DJ46QVYLUR4TKITTJ3IGDANCNFSM4ZMPCCRQ).

andreasscherbaum reviewed Mar 27, 2021

View reviewed changes

Address review issues.

d9050c6

* Fix whitespace missmatch during .sh content generation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a Dockerfile to ease development #66

Create a Dockerfile to ease development #66

gkokolatos commented Mar 18, 2021

mhagander commented Mar 18, 2021

gkokolatos commented Mar 18, 2021 via email

andreasscherbaum commented Mar 18, 2021

gkokolatos commented Mar 18, 2021 via email

gkokolatos commented Mar 24, 2021

andreasscherbaum Mar 27, 2021

gkokolatos Mar 29, 2021

andreasscherbaum Mar 29, 2021

gkokolatos Mar 29, 2021

andreasscherbaum Mar 27, 2021

gkokolatos Mar 29, 2021

andreasscherbaum Mar 29, 2021

andreasscherbaum commented Mar 27, 2021

gkokolatos commented Mar 29, 2021

mhagander commented Mar 31, 2021

rjuju commented Mar 31, 2021

gkokolatos commented Mar 31, 2021 via email

gkokolatos commented Mar 31, 2021 via email

rjuju commented Mar 31, 2021

mhagander commented Apr 21, 2021

gkokolatos commented Apr 22, 2021 via email

Create a Dockerfile to ease development #66

Are you sure you want to change the base?

Create a Dockerfile to ease development #66

Conversation

gkokolatos commented Mar 18, 2021

mhagander commented Mar 18, 2021

gkokolatos commented Mar 18, 2021 via email

andreasscherbaum commented Mar 18, 2021

gkokolatos commented Mar 18, 2021 via email

gkokolatos commented Mar 24, 2021

andreasscherbaum Mar 27, 2021

Choose a reason for hiding this comment

gkokolatos Mar 29, 2021

Choose a reason for hiding this comment

andreasscherbaum Mar 29, 2021

Choose a reason for hiding this comment

gkokolatos Mar 29, 2021

Choose a reason for hiding this comment

andreasscherbaum Mar 27, 2021

Choose a reason for hiding this comment

gkokolatos Mar 29, 2021

Choose a reason for hiding this comment

andreasscherbaum Mar 29, 2021

Choose a reason for hiding this comment

andreasscherbaum commented Mar 27, 2021

gkokolatos commented Mar 29, 2021

mhagander commented Mar 31, 2021

rjuju commented Mar 31, 2021

gkokolatos commented Mar 31, 2021 via email

gkokolatos commented Mar 31, 2021 via email

rjuju commented Mar 31, 2021

mhagander commented Apr 21, 2021

gkokolatos commented Apr 22, 2021 via email