Solr itself has an internal limit to the number of results you can return on a single page (I think it is 1000) and AFAIK always returns a paged result. For speed and memory usage over large result sets it would probably be most efficient to build in paging logic. > On Aug 31, 2016, at 10:45 PM, Tod Olson <[log in to unmask]> wrote: > > On a related note, do any of the libraries allow the user to iterate over a large result set without having to be aware of repeated calls, incrementing the start parameter, and that sort of bookkeeping? > > It seems like someone must have built an iterator to hide that when you're trying to sift through a large number of hits. > > -Tod > >> On Aug 31, 2016, at 4:09 PM, Rhoads, Joseph <[log in to unmask]> wrote: >> >> I've used several of these. I like the interface of mysolr but (as >> mentioned) it hasn't been updated in a while. >> >> pysolr is fairly up to date (v3.5 came out in May this year), and is used >> in django-haystack for the solr backend. >> https://github.com/django-haystack/pysolr >> >> Haystack itself is great if you want an ORM-like interface for solr and use >> django. >> https://github.com/django-haystack/django-haystack >> >> -Joseph >> >> >> >>> On Wed, Aug 31, 2016 at 3:42 PM, Chris Gray <[log in to unmask]> wrote: >>> >>> I haven't done much of that but you can submit documents via the API and >>> have them indexed (and processed by Tika). Once you understand how to do >>> that, you might find that you can do everything you want to do. >>> >>> An alternative would be reading the source of one of those libraries. In >>> the list you referenced, the only mention of inserting documents was for >>> sunburnt. I would be inclined to look there first, especially since it >>> mentions a pythonic interface to Solr. >>> >>> A good, and amusing, cautionary tale about overwritten Python libraries is >>> at https://www.youtube.com/watch?v=o9pEzgHorH0. >>> >>> Chris >>> >>> >>>> On 2016-08-31 03:28 PM, Eric Lease Morgan wrote: >>>> >>>> On Aug 31, 2016, at 3:25 PM, Chris Gray <[log in to unmask]> wrote: >>>> >>>> Okay, there are SO many Python libraries [1] for Solr, and I’d like to >>>>>> know which one is the most popular (not necessarily the “best”). >>>>> What do you want to do with it? >>>>> >>>>> I didn't feel the need to even look for a Python library for my needs. >>>>> I use Python to submit searches to the Solr web API and consume the results >>>>> as JSON. >>>> >>>> Good question. I want to add documents to a Solr index, and I want to >>>> query the same index. Hmmm… —Eric M. >