Someone is on the list is bound to have extra megabytes left on their
archive.org sub at the end of the period. Maybe we could have a wiki page
describing the best crawl config so nothing gets left out? Remember that
re-crawling the same content doesn't incur a cost...
...let us be heard from red core to black sky
On 13 December 2017 at 07:48, Kyle Banerjee <[log in to unmask]> wrote:
> On Tue, Dec 12, 2017 at 9:10 AM, Eric Lease Morgan <[log in to unmask]> wrote:
> > As I sit here watching my EAD files get indexed by Solr, I ask myself,
> > what degree are we — the Code4Lib community — curating our content?”
> > Seriously, our “community” generates content, and the bulk of it takes
> > three or four forms: the mailing list, the journal, the wiki, and
> > conference agendas/schedules. How “important” is this content? While it
> > very well be backed up, and while it may very well be restorable, I
> > about its intrinsic values...
> Generally speaking, if you have to wonder about the value of something, you
> already have the answer.... ;)
> But seriously, just because a theoretical use case can be imagined is not a
> good reason to dedicate resources -- this is the very definition of a
> solution looking for a problem.
> Whether or not content is formally organized, the good stuff has and will
> continue to permeate thinking/systems/processes elsewhere.