Print

Print


This conversation is heading into the "draining the swamp" category.

Bill Denton started this thread with the suggestion that we use HTTPS everywhere. He did not make a specific case for it. I am just guessing that an argument for going that route would include security.

Regardless of whether this is a good idea, or whether there is a compelling reason for doing it, it seems to me that the possibility of its making it difficult for older scraping tools to scrape the site does not seem like a compelling reason not to do it.

The cost issue, on the other hand, would be a more compelling consideration.

Thanks,

Cary

On Nov 6, 2013, at 6:17 PM, Ross Singer <[log in to unmask]> wrote:

> How is security getting thrown under the bus?
> 
> -Ross.
> 
> On Wednesday, November 6, 2013, Cary Gordon wrote:
> 
>> It sounds like we are willing to throw security under the bus for an edge
>> case, although I am sure that I am missing some subtlety
>> 
>> Cary
>> 
>> On Nov 5, 2013, at 10:27 AM, Ross Singer <[log in to unmask]<javascript:;>>
>> wrote:
>> 
>>> On Tue, Nov 5, 2013 at 12:07 PM, William Denton <[log in to unmask]<javascript:;>>
>> wrote:
>>> 
>>>> 
>>>> (Question:  Why does HTTPS complicate screen-scraping?  Every decent
>> tool
>>>> and library supports HTTPS, doesn't it?)
>>>> 
>>> 
>>> Birkin asked me this same question, and I realized I should clarify what
>> I
>>> meant.  I was mostly referring to existing screen scrapers/existing web
>>> sites.  If you redirect every request from http to https, this will
>>> probably break things.  I think the Open Library example that Karen
>>> mentioned is a good case study.
>>> 
>>> And it's pretty different for a library or tool to support HTTPS and a
>>> specific app to be expecting it.  If you follow the thread around that OL
>>> change, it appears there are issues with Java (as one example)
>> arbitrarily
>>> consuming HTTPS (from what I understand, you need to have the cert
>>> locally?), but I don't know enough about it to say for certain.  I think
>>> there would also probably be potential issues around mashups (AJAX, for
>>> example), but seeing as code4lib.org doesn't support CORS, not really a
>>> current issue.  Does apply more generally to your question about library
>>> websites at large, though.
>>> 
>>> Anyway, I agree with you that the option for both should be there.  I'm
>> not
>>> just not convinced that HTTPS-all-the-time is necessary for all web use
>>> cases.
>>> 
>>> -Ross.
>>