New deployer #1284

escattone · 2020-09-19T00:46:34Z

Part of #1106

Adds new update-lambda-functions sub-command and significantly refactors the upload sub-command.

(deployer-0wsXDrGt-py3.8) bash-3.2$(maws_profile)$ deployer --help
Usage: deployer [OPTIONS] COMMAND [ARGS]...

Options:
  --debug / --no-debug
  --dry-run             Show what would be done, but don't actually do it.
                        [default: False]

  --version             Show the version and exit.
  --help                Show this message and exit.

Commands:
  update-lambda-functions
  upload

~~I still need to update the deployer README.~~ ✅

Does a full upload of the current production build of https://github.com/mdn/content as well as its redirects in about 4.5 minutes:

(deployer-0wsXDrGt-py3.8) bash-3.2$(maws_profile)$ deployer upload ../client/build/ --folder test
Warning: No content-translated-root has been specified.

Deployer (0.3.0)
Upload files from: ../client/build
Upload redirects from: /Users/rjohnson/repos/content/files
Upload into: test/ of mdn-content-dev
Total pending redirect uploads: 15,981 (18.9ms)
Total pending file uploads: 24,705 (554.4ms)
Total existing S3 objects: 0 (216.4ms)
  [▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋▋]  40686/40686  100%
Total uploaded files: 24,705 (1066.0MB)
Total uploaded redirects: 15,981
Total skipped files: 0 matched existing S3 objects
Total upload/skip time: 4m23s
Done in 4m24s.

Example document:
https://test.content.dev.mdn.mozit.cloud/en-us/docs/web/css
Example redirect:
https://test.content.dev.mdn.mozit.cloud/en-us/docs/-moz-locale-dir(ltr)

peterbe

I'm not done yet. Just dropping some early feedback.

By the way. It works!!!
I don't know what isn't working yet because all the files and redirects are in the bucket.

peterbe · 2020-09-22T20:08:08Z

deployer/src/deployer/main.py

-    "--name",
-    default=DEFAULT_NAME,
-    help=f"Name of the site (default {DEFAULT_NAME_PATTERN!r})",
+    help='Name of the S3 bucket or one of "dev", "stage", or "prod"',


This doesn't make sense to me. AWS S3 bucket names are supposed to be globally unique. These suggestions there don't make sense then because you'd never get away with calling an S3 bucket "dev".

As a matter of fact, I see no reason to have a default here. (I believe the default is DEFAULT_BUCKET_NAME = config("DEPLOYER_BUCKET_NAME", default="dev"))

The dev, stage, and prod options are bucket nicknames or shortcuts for mdn-content-dev, mdn-content-stage, and mdn-content-prod (they're converted to the real bucket name in upload.py). Just a convenience since they're the names we'll be using 99% of the time. I'll change the code to indicate that these are special nicknames.

We discussed this and agreed that it's a bit too magical to use nick names. Instead, let's keep it dumb and spell out the whole bucket name in full.

deployer/src/deployer/main.py

peterbe · 2020-09-22T20:38:33Z

deployer/src/deployer/main.py

+    dirpath = Path(directory)
+
+    if not dirpath.exists():
+        raise click.ClickException(f"{directory} does not exist")


Instead of comments, and because I wanted to back up what I was about to say, I put this together: https://github.com/escattone/yari/pull/43

Thanks for the cool validation tips! I incorporated all of your suggestions except for:

for fp in content_root.glob("**/_redirects.txt"):

I had originally used that, but it was significantly slower than

for fp in iterdir(content_root, max_depth=1): if fp.name != "_redirects.txt": continue

Since the redirect files are always under the locale, I could use (note the one star):

for fp in content_root.glob("*/_redirects.txt"):

Slower, but can you even notice the difference?

Oh. I didn't know for fp in content_root.glob("*/_redirects.txt"): was a thing even. But that actually makes sense.
But this is all nits in a sense since your existing ad-hoc solution works perfectly fine.