If you need to archive a Facebook account now (like, right now), what should you do?| anjackson.net
Public version of the midpoint review report for the Registries of Good Practice project.| anjackson.net
About FOSDEM 2025, and my rejected Main Track talk proposal.| anjackson.net
Using static website tools to make a new community resource.| anjackson.net
| anjackson.net
| anjackson.net
| anjackson.net
| anjackson.net
Notes and articles on digital preservation| anjackson.net
A two-year project experimenting with ways of supporting digital preservation practices and the information sources we rely on.| anjackson.net
Billie Eilish clearly has concerns about migrating content off of old media.| anjackson.net
Following my brief post on Finding Formats in WikiData, Ross Spencer posted a response that was full of useful information.| anjackson.net
ADDENDUM: Please note that this was a bit rushed, so there’s a follow up post that provides some more context, but you should probably still read this one first.| anjackson.net
I was the Technical Lead for the UK Web Archive at the British Library for twelve years.| anjackson.net
Every so often, there’s some publication or panel that worries about all that data out there.| anjackson.net
I’m not sure publishing For The Love was really a good idea.| anjackson.net
Yesterday I was reading about how I should Get Ready for the Great Digital Preservation Bake Off at iPRES 2024!| anjackson.net
I have a theory about books: people love them.| anjackson.net
Having pulled together an initial version of the iPRES proceedings dataset, the next question is: How can we make this data easier to access and use?| anjackson.net
As part of the Registries of Good Practice project, we wanted to start by seeing if we could make it easier to find the records of research and practice that make up the iPRES digital preservation conference proceedings.| anjackson.net
This is a very silly thing I wrote many years ago.| anjackson.net
This is my /now page.| anjackson.net
By Andy Jackson, Web Archiving Technical Lead at the British Library (until January 2024)| anjackson.net
After a small delay, we were finally able to formally launch the Registries of Good Practice project!| anjackson.net
On Monday and Tuesday, I was still working at the British Library.| anjackson.net
About ten years ago I realised that, for a personal website, Drupal was far too much work to maintain.| anjackson.net
First published as this UK Web Archive blog post.| anjackson.net
First published as this UK Web Archive blog post| anjackson.net
Carefully moving files between storage systems is a critical part of digital preservation.| anjackson.net
First published on the UKWA blog…| anjackson.net
Recently, the Library of Congress’s excellent Web Archive Team sent out a request to gather information about how web archives cope with large websites.| anjackson.net
First publicised in this UK Web Archive blog post.| anjackson.net
As the ZIP scanning issue keeps getting updated, I realised I made some errors in my analysis of how DROID works.| anjackson.net
Following on from the previous post, I was experimenting with Siegfried and found it to be even faster than I was expecting!| anjackson.net
In the last few days, I’ve been going through the process of updating my Nanite wrapper for DROID, which I built to make it easier to re-use DROID’s identification engine in other contexts – especially in large-scale Hadoop jobs where we want to process every record in our WARCs.| anjackson.net
A recent comment from the #DHNB2023 conference caught my eye…| anjackson.net
Usually we don’t let search engines index web archives.| anjackson.net
This is a summary of what’s been going on since the update at the start of the autumn.| anjackson.net
I’m not really into new year resolutions, but I would like to get back into blogging.| anjackson.net
Recently, for no particular reason I’m sure, there seems to have been a renewed interest in more distributed and community-oriented ways of finding good stuff on the web.| anjackson.net
This is a summary of what’s been going on since the update at the start of the summer.| anjackson.net
Following on from the last quarterly update, we’ve been able to make some good progress despite being understaffed during this period.| anjackson.net
This is a summary of what’s been going on since the last update, at the start of the year.| anjackson.net
During the last quarter of 2021, the technical services that make up the web archive underwent lot of changes behind the scenes.| anjackson.net
Abstract # Under Legal Deposit, our crawl capacity needs grew from a few hundred time-limited snapshot crawls to the continuous crawling of hundreds of sites every day, plus annual domain crawling.| anjackson.net
I love a digital preservation mystery, and this one started with question from @joe on digipres.| anjackson.net
Today is the inaugural International Digital Preservation Day, and as a small contribution to that excellent global effort I thought I’d write about the current state of the open source tools that enable access to web archives.| anjackson.net
Originally published on the UK Web Archive blog on the 10th of November 2017.| anjackson.net
Before I revisit the ideas explored in the first post in the blog series I need to go back to the start of this story…| anjackson.net
Abstract # As an increasing number of government and other publications move towards online-only publication, we are force to move our traditional Legal Deposit processes based on cataloging printed media.| anjackson.net
This is the script for the introduction I gave as part of a ‘Digital Conversations at the BL’ panel event: Web Archives: truth, lies and politics in the 21st century on Wednesday 14th of June, as part of Web Archiving Week 2017.| anjackson.net
Abstract # The British Library has a long tradition of preserving the heritage of the United Kingdom, and processes for handling and cataloguing print-based media are deeply ingrained in the organisations structure and thinking.| anjackson.net
Originally published on the UK Web Archive blog on the 8th of June 2017.| anjackson.net
Following my previous post, a tweet from Raffaele Messuti lead me to this quote:| anjackson.net
So what was going on in our little experiment in data destruction?| anjackson.net
Following my proposed experiment in data destruction, a few kind readers tried it out and let me know what happened1.| anjackson.net
Let’s start with an experiment…| anjackson.net
I find working in digital preservation fascinating.| anjackson.net
I came to work on digital preservation through the PLANETS project, and later the SCAPE project (for the first year) before moving over to web archiving.| anjackson.net
Four years ago, during the 2012 IIPC General Assembly, we came together to discuss the recent and upcoming challenges to web archiving in the Future of the Web Workshop (see also this related coverage on David Rosenthal’s blog).| anjackson.net
Originally published on the UK Web Archive blog on the 15th of February 2016.| anjackson.net
Originally published on the UK Web Archive blog on the 20th November 2015.| anjackson.net
A few months ago, a colleague suggested that we should come up with ways of helping people learn about the main stages of web archiving, and to help them understand some of the more common technical terminology.| anjackson.net
On the first day of the IIPC GA 2015, the morning keynote was Digital Vellum: Interacting with Digital Objects Over Centuries, presented by Vint Cerf and Mahadev Satyanarayanan.| anjackson.net
As a computational physicist working in a library, my background and training is quite different to the curators and researchers I now work with.| anjackson.net
As published on the UK Web Archive blog.| anjackson.net
Following Vint Cerf’s talk at AAAS, the “Digital Dark Age” is in the news again (see DSHR’s blog for a good summary, or one of the ~200 other news articles about it!| anjackson.net
As published on the UK Web Archive blog.| anjackson.net
As published on the UK Web Archive blog.| anjackson.net
First published on the UK Web Archive blog.| anjackson.net
First published on the UK Web Archive blog.| anjackson.net
A new OPF blog entry: User-Driven Digital Preservation.| anjackson.net
First published on the UK Web Archive blog.| anjackson.net
First published on the UK Web Archive blog.| anjackson.net
First published on the UK Web Archive blog.| anjackson.net
Big UK Domain Data for the Arts and Humanities project to build a prototype historical search engine| anjackson.net
Community-owned digital preservation resources| anjackson.net
Tools for analysing and indexing web archives| anjackson.net
A new OPF blog entry: Digital Preservation War Stories.| anjackson.net
A new OPF blog entry: The Registries We Need.| anjackson.net
Notes and experiments on digital preservation.| anjackson.net
A new OPF blog entry: Analysing the formats in the UK Web Archive.| anjackson.net
A new OPF blog entry: Biodiversity and the registry ecosystem.| anjackson.net
The Analytical Access to the Dark Domain Archive (AADDA) Project.| anjackson.net
A new OPF blog entry: A Format Registry for SCAPE.| anjackson.net
A new OPF blog entry: What do we mean by format?| anjackson.net
A new OPF blog entry: OPF Year 1: Visualisation of development activity.| anjackson.net
A new OPF blog entry: Cargo Cult Standards.| anjackson.net
A new OPF blog entry: Economical Access via Normalisation.| anjackson.net
New OPF Blog: Building A Collaborative Format Registry Editor.| anjackson.net
A new OPF blog entry: Format Obsolescence and Sustainable Access.| anjackson.net
A new OPF blog entry: Is obsolescence overrated?| anjackson.net
Thanks to our web archiving team (who lead the uk web archive project), I was given a day of training on using Hadoop today.| anjackson.net
A new OPF blog entry: Breaking Down The Format Registry.| anjackson.net
A new OPF blog entry: In the room.| anjackson.net
As well as blogging about digital preservation here, I’ve also got a blog on the Open Planets Foundation website where I’ll post about OPF issues.| anjackson.net
About # Tools for turning a collection of xml metadata (MODS, METS, EAD) and digital assets into an online digital library with a minimum of effort.| anjackson.net