+1 to OpenRefine. Some extensions, like RDF Refine <http://refine.deri.ie/>, currently only work with the old Google Refine (still available here <https://code.google.com/p/google-refine/>). There's a good deal of interesting projects for OpenRefine on GitHub and GitHub Gist. Google Docs Spreadsheets also has a surprising amount of functionality, such as importXML if you're willing to get your hands dirty with regular expressions. Dave On Tue, Jul 8, 2014 at 3:12 AM, Tillman, Ruth K. (GSFC-272.0)[CADENCE GROUP ASSOC] <[log in to unmask]> wrote: > Definite cosign on Open Refine. It's intuitive and spreadsheet-like enough > that a lot of people can understand it. You can do anything from > standardizing state names you get from a patron form to normalizing > metadata keywords for a database, so I think it'd be useful even for > non-techies. > > Ruth Kitchin Tillman > Metadata Librarian, Cadence Group > NASA Goddard Space Flight Center Library, Code 272 > Greenbelt, MD 20771 > Goddard Library Repository: http://gsfcir.gsfc.nasa.gov/ > 301.286.6246 > > > -----Original Message----- > From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of > Terry Brady > Sent: Monday, July 07, 2014 1:35 PM > To: [log in to unmask] > Subject: Re: [CODE4LIB] 'automation' tools > > I learned about Open Refine <http://openrefine.org/> at the Code4Lib > conference, and it looks like it would be a great tool for normalizing > data. I worked on a few projects in the past in which this would have been > very helpful. >