Link Search Menu Expand Document

Quality control

Once we have created a web archive for a site, we need people who can (more or less) read Ukrainian or Russian to check the archives to make sure that they didn’t miss anything major.

  • Go to the “QC” tab in the SUCHO working sheet (linked in the SUCHO Slack)
    • Column G has the link to the “WACZ file”, which is the web archive.
    • Column I has the link for reviewing the web archive
    • Column J is for QC status
    • Column K is for QC person
    • Column L is for QC notes
  • Find a row with an empty QC status. Set the status to “In progress” and put your name down as the “QC person”
  • Take the link from column I (which should start with https://replayweb.page) and open it in your browser. It may take a few moments to load.
  • Click on the page title that appears once it loads. This will take you to the web archive.
  • Browse the site thoroughly, as if you were browsing a normal webpage. Check menus, submenus, subpages, items, PDFs, anything else that appears as you go.
  • If you see broken images, PDFs not loading, internal site links that aren’t captured (it should pop up a warning if something isn’t part of the archive itself), make a note of it, and copy the URL from your browser, which will show us which subpage of the archive the broken thing was on.
    • If you go through the site and feel like it’s complete, change the QC status to “Done” in the SUCHO working spreadsheet and move on.
    • If a site is mostly good but there’s a few things that need to be captured, change the QC status to “Incomplete” and put your notes (and the URLs) in column L
    • If there’s a lot of things missing, set the status to “Broken”.