I wrote an article about some of the fuss we had to deal with relating to redirecting to https.
http://go-to-hellman.blogspot.com/2013/10/internet-plumbing-mixed-redirect-chains.html
we dealt with some of the internet archive hiccups, too, but these were more to do with changing a website rather than anything having to do with https.
Another thing to remember is that it doesn't do much good to deliver html over https if you need graphics or scripts that for one reason or another require http. I expect planet.code4lib.org, and perhaps some of the other sites would be a lot of work to switch over, and they might not actually work.
Eric
On Nov 5, 2013, at 1:27 PM, Ross Singer <[log in to unmask]> wrote:
> On Tue, Nov 5, 2013 at 12:07 PM, William Denton <[log in to unmask]> wrote:
>
>>
>> (Question: Why does HTTPS complicate screen-scraping? Every decent tool
>> and library supports HTTPS, doesn't it?)
>>
>
> Birkin asked me this same question, and I realized I should clarify what I
> meant. I was mostly referring to existing screen scrapers/existing web
> sites. If you redirect every request from http to https, this will
> probably break things. I think the Open Library example that Karen
> mentioned is a good case study.
>
> And it's pretty different for a library or tool to support HTTPS and a
> specific app to be expecting it. If you follow the thread around that OL
> change, it appears there are issues with Java (as one example) arbitrarily
> consuming HTTPS (from what I understand, you need to have the cert
> locally?), but I don't know enough about it to say for certain. I think
> there would also probably be potential issues around mashups (AJAX, for
> example), but seeing as code4lib.org doesn't support CORS, not really a
> current issue. Does apply more generally to your question about library
> websites at large, though.
>
> Anyway, I agree with you that the option for both should be there. I'm not
> just not convinced that HTTPS-all-the-time is necessary for all web use
> cases.
>
> -Ross.
|