I agree. Kibana looks promising for log analysis and visualization...
http://rashidkpc.github.com/Kibana/index.html
On 8/30/12 9:10 PM, Lars Aronsson wrote:
> On 2012-08-31 00:02, Shearer, Timothy J wrote:
>> What we have is a webserver with 64,665 files (html, css, js, jpg, you
>> get
>> the idea) and lots of directories with subdirectories.
>>
>> The goal is to be able to conveniently take all that in in a way that
>> makes it pretty simple to see/navigate (say for a public services staff
>> member tasked with doing a survey of the old content) so that we can
>> get a
>> handle on what's there (prior to say, moving from a php+html template
>> approach to a CMS). It's about exploring the website from under the
>> hood.
>
> I'd recommend starting with the web server log files.
> Maybe a handful of those files are 95% of your traffic,
> and the rest is odd or peculiar "long tail" information.
>
>
--
Shaun D. Ellis
Digital Library Interface Developer
Firestone Library, Princeton University
voice: 609.258.1698 | [log in to unmask]
|