Related to our discussion: http://online.wsj.com/article/SB125139942345664387.html I particularly like the quote at the end: > "Digital information lasts forever -- or five years," says RAND Corp. > computer analyst Jeff Rothenberg, "whichever comes first." Tim McGeary Team Leader, Library Technology Lehigh University 610-758-4998 [log in to unmask] Google Talk: timmcgeary Yahoo IM: timmcgeary Edward Iglesias wrote: > Thanks to all of you who answered. Crowdsourcing does work if you > pick the right crowd. We have been looking at the S3 possibility but > I agree this would have to be a second copy. The policy and > institutional support comments from my tokayo > > see http://en.wiktionary.org/wiki/tocayo > > seem especially appropriate. I am going to include a link on our > staff blog to this thread as a resource. > > Thanks again, > > Edward Iglesias > > > > On Thu, Aug 27, 2009 at 8:59 PM, Edward M. > Corrado<[log in to unmask]> wrote: >> Joe Atzberger wrote: >>> On Thu, Aug 27, 2009 at 4:25 PM, Edward M. Corrado >>> <[log in to unmask]>wrote: >>> >>> >>>> Nate Vack wrote: >>>> >>>> >>>>> On Thu, Aug 27, 2009 at 1:57 PM, Ryan >>>>> Ordway<[log in to unmask]> wrote: >>>>> >>>>> >>>>> >>>>>> $213,360 over 3 years >>>>>> >>>>>> >>>>>> >>>>> If you're ONLY looking at storage costs, SATA drives in >>>>> enterprise RAID >>>>> >>>>>> systems range from about $1.00/GB to about $1.25/GB for >>>>>> online storage. >>>>>> >>>>>> >>>>> Yeah -- but if you're looking only at storage costs, you'll >>>>> have an inaccurate estimate of your costs. You've got power, >>>>> cooling, sysadmin time, and replacements for failed disks. If >>>>> you want an apples-to-apples comparison, you'll want an >>>>> offsite mirror, as well. >>>>> >>>>> I'm not saying S3 is always cost-effective -- but in our >>>>> experience, the costs of the disks themselves is dwarfed by >>>>> the costs of the related infrastructure. >>>>> >>>>> I agree that the cost of storage is only one factor. I have >>>>> to wonder >>>>> >>>> though, how much more staff time do you need for local storage >>>> than cloud storage? I don't know the answer but I'm not sure it >>>> is much more than setting up S3 storage, especially if you have >>>> a good partnership with your storage vendor. >>>> >>> >>> Support relationships, especially regarding storage are very >>> costly. When I worked at a midsize datacenter, we implemented a >>> backup solution with STORServer and tivoli. Both hardware and >>> software were considerably costly. Initial and ongoing support, >>> while indispensable was basically as much as the cost of the >>> hardware every few years. >>> >> They can be depending on what you are doing and what choices on >> software you make, but for long term preservation purposes they >> don't have to be nearly as expensive as what Ryan calculated S3 to >> cost. If you shop around you can get a quality 36GB array with 3 yr >> warranty for say $30,000 that is almost $180,000 less than S3 >> (probably much less, I'm be less than generous with my Sun >> discounts and only briefly looked at there prices). Even if we use >> the double your cost for support, it is still over $50,000 a year >> less for 3 years. Yes, we might need some expertise, but running a >> 36TB preservation storage array is not a $50,000 a year job and >> besides, what is wrong with growing local expertise? >> >> ... >>>> Yes, maybe you save on staff time patching software on your >>>> storage array, but that is not a significant amount of time - >>>> esp. since you are still going to have some local storage, and >>>> there isn't much difference in staff time in doing 2 TB vs. 20 >>>> TB. >>>> >>> >>> There's a real difference. I can get 2 TB in a single HDD, for >>> example this one for $200 at NewEgg: >>> http://www.newegg.com/Product/Product.aspx?Item=N82E16822148413 >>> <http://www.newegg.com/Product/Product.aspx?Item=N82E16822148413> >>> >>> >>> Any high school kid can install that. 20 TB requires some kind >>> of additional structure and additional expertise. >>> >> Well building a 20 TB storage device and getting it to work can >> actually be very cheap and doesn't require a PhD (just a local >> GNU/Linux geek who likes to play with hardware) if you are OK with >> a home grown solution. I wouldn't be satisfied with that, but I >> don't see how a commercial offering that adds up to $150,000 worth >> of expertise and infrastructure. >> >>> You may some time on the initial configuration, but you still >>> need to >>> >>>> configure cloud storage. Is cloud storage that much easier/less >>>> time consuming to configure than an iSCSI device? Replacement >>>> for disks would be covered under your warranty or support >>>> contract (at least I would hope you would have one). >>>> >>> >>> Warranties expire and force you into ill-timed, hardly-afforded >>> and dangerous-to-your-data upgrades. Sorta like some ILS systems >>> with which we are all familiar. >> Yes some application upgrades can cause issues, but how is that >> different if your application and/or storage is in a cloud? >> >>> The cloud doesn't necessarily stay the same, but the part you >>> care about (data in, data out) does. >>> >> How do you know they won't change their cloud models? And you don't >> even have a warranty with the cloud. They won't even guarantee they >> won't delete your data. >> >> As long as you use a common standards based method of storage, you >> won't have any more issues getting it to work than you will getting >> future application servers to work with the cloud. While I'm not a >> huge fan of NFS I've been using it for many years with no problems >> due to changes in NFS or operating systems or hardware. NFS has >> been available to the public for about 20 years. Occasionally you >> may need to migrate it from one platform or one machine to another >> but you very well need to do that with clouds as well. Maybe you >> are using S3 but for whatever reason Sun gives you a better deal >> with better terms and guarantees for using their cloud. Maybe >> Amazon drops S3. Maybe because S3 moves servers to a country that >> you are not legally allow to have your data in. Yes, you have to >> plan for migration to new platforms but I fail to see how you don't >> need to do that with the cloud. Really any major technological >> decision should have an exit plan. Preservation storage is not >> different in that and the cloud doesn't change that. >> >> Edward >> >> >>>> The power and cooling can be a savings, but in many cases the >>>> library or individual departments don't pay for electricity, so >>>> while *someone* pays the cost, it might not be the individual >>>> department. Cooling and electricity costs are an actually a >>>> great argument for tape for large-scale storage. Tape might >>>> seem old fashioned, but in many applications it by far offers >>>> the best value of long term storage per GB. >>>> >>> >>> It's true, tape is still an worthwhile option. Alternatives like >>> optical or magneto-optical media just have not kept up. >>> >>> Again, I'm not totally against the cloud and there are some >>> things I think >>> >>>> it could be very useful for, but the cloud doesn't make up for >>>> the lack of (or just bad) planning. >>>> >>> >>> Yeah, there's no system good enough to compensate for bad >>> planning and management. --Joe >>> >