Sarah
You might be interested in Constellate’s Skill-Build program. All participants will do a text classification project using OpenAI’s API by the end of the semester. UIUC is trialing Constellate this semester so anyone at UIUC may participate. We’ve run the intro Python classes already (though you are welcome to catch up by the recordings). The LLM classes run in October and November.
https://constellate.org/skill-build
AMY
________________________________
From: Code for Libraries <[log in to unmask]> on behalf of EDWIN VINCENT SPERR <[log in to unmask]>
Sent: Monday, September 30, 2024 9:23:02 AM
To: [log in to unmask] <[log in to unmask]>
Subject: Re: [CODE4LIB] CODE4LIB Digest - 28 Sep 2024 to 29 Sep 2024 (#2024-202)
>>>>>Caution: This message did not originate from within ITHAKA's email system. Please use caution when opening attachments and following links within this message.<<<<<
Sara --
Were y'all looking to do some sort of broad bibliometric study along the lines of "Particle physics has 20% fewer 'Review' articles than structural engineering"?
If so, I'd be temped to pull out a random sample from each discipline and classify them manually...
(Of course, that's easy for me to say as the person who won't be trawling through hundreds of records)
Otherwise, as Rodrigo suggests, you're gonna be building a classifier...
Ed Sperr, MLIS
Systems and Discovery Librarian
[log in to unmask]
University of Georgia Libraries, Main Library
Athens, GA 30602-1641
Date: Sun, 29 Sep 2024 22:25:58 +0000
From: "Park, Sarah" <[log in to unmask]>
Subject: identifying publication types from citations
Hi,
I am looking for a tool or method that can help us identify publication types from citations/references using scripts or AI-based tools. My colleague and I are interested in citation analysis to determine the types of sources used in a discipline, for example, journal articles, review articles, magazine articles, book chapters, books, websites, government documents (Gov Docs), and NGO documents.
One possible method I got so far was using article database APIs, like Scopus, to identify document types, but Scopus seems to track some types but not all. I also heard that a model can be trained using ChatGPT or other generative AI, but I haven't heard how effective it can be.
Any thoughts or suggestions that could lead to a possible solution would be greatly appreciated!
Best,
Sarah G. Park, she/her
Mathematics and Computational Sciences Librarian Head, Mathematics Library Assistant Professor University of Illinois at Urbana-Champaign [log in to unmask]<mailto:[log in to unmask]>
|