I'm very interested too. (Though, I don't know if you're saying this would
be at a Code4Lib conference, if so I won't be there. But if you're thinking
of doing this virtually I'd love to).
Brad
On Sat, Nov 11, 2017 at 3:16 PM, Dhanushka Samarakoon <[log in to unmask]>
wrote:
> I'm interested too.
>
> On Thu, Nov 9, 2017 at 11:45 AM, Haitz, Lisa (haitzlm) <
> [log in to unmask]> wrote:
>
> > I love it!
> >
> > On 11/9/17, 1:13 PM, "Code for Libraries on behalf of Eric Lease Morgan"
> <
> > [log in to unmask] on behalf of [log in to unmask]> wrote:
> >
> > I’m thinking about a hands-on workshop on natural language processing
> > & text mining, below, and your feedback is desired. —ELM
> >
> >
> > Natural language processing & text mining using freely available
> > tools: "No programming necessary"
> >
> > This text outlines a hands-on natural language & text mining
> workshop.
> >
> > It is possible to do simple & rudimentary natural language processing
> > & text mining with a set of freely available tools. No programming is
> > necessary. This workshop facilitates hands-on exercises demonstrating how
> > this can be done. By participating in this workshop, students &
> researchers
> > will be able to:
> >
> > * identify patterns, anomalies, and trends in their texts
> > * practice both "distant" and "scalable" reading
> > * enhance & complement their ability to do "close" reading
> > * use & understand a corpus of poetry or prose at scale
> >
> > Activities in the workshop include:
> >
> > * learning what natural language processing is, and why you should
> > care
> > * articulating a research question
> > * creating a corpus
> > * creating a plain text version of a corpus with Tika [1]
> > * using Voyant Tools to do some "distant" reading" [2]
> > * using a concordance (AntConc) to facilitate searching keywords in
> > context [3]
> > * creating a simple word list with a text editor
> > * cleaning & analyzing word lists with OpenRefine [4]
> > * charting & graphing word lists with Tableau Public [5]
> > * increasing meaning by extracting parts-of-speech with the
> Standford
> > POS Tagger [6]
> > * increasing meaning some by extracting named entities with the
> > Standford NER [7]
> > * identifying themes and clustering documents using MALLET [8]
> >
> > Anybody with sets of texts can benefit from this workshop. Any corpus
> > of textual content is apropos: journal articles, books, the complete run
> of
> > a magazine, blog postings, Tweets, press releases, conference
> proceedings,
> > websites, poetry, etc. This workshop is computer (Windows, Linux,
> > Macintosh) agnostic. All the software used in this workshop is freely
> > available on the 'Net, or it is already installed on one's computer.
> Active
> > participation requires zero programming, but students must bring their
> own
> > computer, and they must not be afraid of their computer's command line
> > interface.
> >
> > This workshop will not make participants an expert in natural
> language
> > processing, but it will empower them to make better sense of large sets
> of
> > textual information.
> >
> > [1] Tika - http://tika.apache.org
> > [2] Voyant - http://voyant-tools.org
> > [3] AntConc - http://www.laurenceanthony.net/software/antconc/
> > [4] OpenRefine - http://openrefine.org
> > [5] Tableau Public - https://public.tableau.com/
> > [6] POS Tagger - https://nlp.stanford.edu/software/tagger.shtml
> > [7] NER - https://nlp.stanford.edu/software/CRF-NER.shtml
> > [8] MALLET - http://mallet.cs.umass.edu
> >
> >
> >
>
--
Brad Coffield, MLIS
Assistant Information and Web Services Librarian
Saint Francis University
814-472-3315
[log in to unmask]
|