ps: the distribution of the full text availability for the sample considered was as follows: No preview: 797 (93.5%) Partial preview: 53 (6.2%) Full text: 2 (0.2%) - Godmar On Tue, May 6, 2008 at 6:09 PM, Godmar Back <[log in to unmask]> wrote: > Hi, > > to examine the usability of Google's book viewability API when lookup > is done via ISBN, we did some experiments, the results of which I'd > like to share. [1] > > For 1000 randomly drawn ISBN from 3,192,809 ISBN extracted from a > snapshot of LoC's records [2], Google Books returned results for 852 > ISBN. We then downloaded the page that was referred to in the > "info_url" parameter of the response (which is the "About" page Google > provides) for each result. > > To examine whether Google retrieved the correct book, we checked if > the Info page contained the ISBN for which we'd searched. 815 out of > 852 contained the same ISBN. 37 results referred to a different ISBN > than the one searched for. We examined the 37 results manually: 33 > referred to a different edition of the book whose ISBN was used to > search, as judged by comparing author/title information with OCLC's > xISBN service. (We compared the author/title returned by xISBN with > the author/title listed on Google's book information page.) 4 records > appeared to be misindexed. > > I found the results (85.2% recall and >99% precision, if you allow for > the ISBN substitution; with a 3.1% margin of error) surprisingly high. > > - Godmar > > [1] http://top.cs.vt.edu/~gback/gbs-accuracy-study/ > [2] http://www.archive.org/details/marc_records_scriblio_net >