LISTSERV mailing list manager LISTSERV 16.5

Help for CODE4LIB Archives


CODE4LIB Archives

CODE4LIB Archives


CODE4LIB@LISTS.CLIR.ORG


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

CODE4LIB Home

CODE4LIB Home

CODE4LIB  June 2020

CODE4LIB June 2020

Subject:

2nd CfP SDP@EMNLP 2020: 1st Workshop on Scholarly Document Processing and Shared Tasks (SDP 2020) | Online event announcement & new submission dates *Call for papers*

From:

Eric Lease Morgan <[log in to unmask]>

Reply-To:

Code for Libraries <[log in to unmask]>

Date:

Thu, 4 Jun 2020 08:18:05 -0400

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (317 lines)

[Forwarded upon request. --ELM]


Dear colleagues,

You are invited to participate in the 1st Workshop on Scholarly Document
Processing (SDP 2020) to be held in conjunction with the 2020 Conference on
Empirical Methods in Natural Language Processing (EMNLP 2020) on November
19. The workshop will be held VIRTUALLY with EMNLP 2020.

*Important updates:*


   - The workshop will be held virtually on November 19. Details about mode
   of participation will be released closer to the workshop.
   - The new submission deadline for research papers is August 15, 2020.
   - The new deadline for the shared tasks system runs is August 15, 2020.
   - All three shared tasks are still open for participant registration:
   https://ornlcda.github.io/SDProc/sharedtasks.html#register
   - We are delighted to announce that the workshop will feature two
   keynote speakers:
      - Kuansan Wang, Managing Director, MSR Outreach Academic Services
      - Steinn Sigurðsson, Scientific Director of arXiv, Professor in the
      Department of Astronomy & Astrophysics at The Pennsylvania State
University

*About the workshop:*

The SDP 2020 workshop will consist of a research track and three shared
tasks.

The shared tasks include the 6th edition of the CL-SciSumm shared task (
https://github.com/WING-NUS/scisumm-corpus/) and two new summarization
tasks -- CL-LaySumm and LongSumm -- geared towards easier access to
scientific methods and results.

SDP is led by organizers of BIRNDL (https://philippmayr.github.io/BIRNDL-WS/)
and WOSP (https://wosp.core.ac.uk/) workshop series.

Details about mode of participation will be announced later on our website
and Twitter.
Website: https://ornlcda.github.io/SDProc/
Twitter: https://twitter.com/sdproc


* Detailed call for papers:*

*** Introduction ** *


In addition to the long-standing challenge faced by scholars of keeping up
with the growing literature in their own and related fields, they must now
compete with malign pseudo-science and dis-information in informing public
policy and behavior. This has stimulated workshops and research focused on
enhancing search, retrieval, summarization, and analysis of scholarly
documents. However, the general research community on scholarly document
processing remains fragmented, and efforts towards natural language
understanding of scholarly text that is central to vastly improve all the
said downstream applications are not widespread.

To address these gaps, we propose the first Workshop on Scholarly Document
Processing.
We seek to reach to the broader NLP and AI/ML community to pool the
distributed efforts to improve scholarly document understanding and enable
intelligent access to the published research. The goal of SDP is two-fold:
to increase collaboration between communities interested in leveraging
knowledge stored in scholarly literature and data, and to establish SDP as
the single-focused primary venue for the field.


We seek to appeal to the mainstream NLP and ML community working on SDP
tasks – which are NLP tasks – to publish at SDP as we seek to establish SDP
as the integrated premier venue. We have established a steering committee <
https://ornlcda.github.io/SDProc/steeringcommittee.html> to help us turn
SDP into a conference in the forthcoming years.


* ** Topics of Interest ***

We invite submissions from all communities interested in natural language
processing, information retrieval, and data mining problems in scholarly
documents; and in processing scholarly documents for easier access to
various audiences. The topics of interest include, but are not limited to:

   - Information extraction, text mining and parsing scholarly literature
   - Reproducibility and peer review
   - Lay summarization (i.e., summaries created for non-experts) of
   individual and collections of scholarly documents
   - Discourse modeling and argument mining
   - Summarization and question-answering for scholarly documents
   - Semantic and network-based indexing, search and navigation in
   structured text
   - Graph analysis/mining including citation and co-authorship networks
   - Analysing and mining of citation contexts for document understanding
   and retrieval
   - New scholarly language resources and evaluation
   - Connecting and interlinking publications, data, tweets, blogs or their
   parts
   - Disambiguation, metadata extraction, enrichment, and data quality
   assurance for scholarly documents
   - Bibliometrics, scientometrics, and altmetrics approaches and
   applications
   - Other aspects of scholarly workflows including open access/science,
   and research assessment
   - Infrastructures for accessing scholarly publications and/or research
   data

*** The 6th Computational Linguistics Scientific Document Summarization
Shared Task (CL-SciSumm 2020) ** *
(Organisers: Muthu Kumar Chandrasekaran)

CL-SciSumm is the first medium-scale shared task on scientific document
summarization, with over 500 annotated documents. Last year's CL-SciSumm
shared task introduced large scale training datasets, both annotated from
ScisummNet and auto-annotated. For the task, Systems were provided with a
Reference Paper (RP) and 10 or more Citing Papers (CPs) that all contain
citations to the RP, which they used to summarise RP. This was evaluated
against abstract and human written summaries on ROUGE.


The task is defined as follows:

*Given*: A topic consisting of a Reference Paper (RP) and Citing Papers
(CPs) that all contain citations to the RP. In each CP, the text spans
(i.e., citances) have been identified that pertain to a particular citation
to the RP.

*Task 1A*: For each citance, identify the spans of text (cited text spans)
in the RP that most accurately reflect the citance. These are of the
granularity of a sentence fragment, a full sentence, or several consecutive
sentences (no more than 5).

*Task 1B*: For each cited text span, identify what facet of the paper it
belongs to, from a predefined set of facets.

*Task 2 (optional bonus task)*: Finally, generate a structured summary of
the RP from the cited text spans of the RP. The length of the summary
should not exceed 250 words.

This year, CL-SciSumm '20 will have two new tracks: *LaySumm* and *LongSumm*
.

*** CL-LaySumm 2020: The 1st Computational Linguistics Lay Summary
Challenge Shared Task ** *
(Organisers: Anita De Waard, Ed Hovy)

To ensure and increase the relevance of science for all of society and not
just a small group of niche practitioners, researchers have been
increasingly tasked by funders and publishers to outline the scope of their
research for the general public by writing a summary for a lay audience, or
lay summary. The LaySumm summarization task considers automating this
responsibility, by enabling systems to automatically generate lay
summaries. A lay summary explains, succinctly and without using technical
jargon, what the overall scope, goal and potential impact of a scholarly
paper is.

The corpus for this task will comprise full-text papers with lay summaries,
in a variety of domains, and from a number of journals. Elsevier will make
available a collection of lay summaries from a multidisciplinary collection
of journals, as well as the abstracts and full text of these journals.

The task is defined as follows:

*Given*: A full-text paper, its abstract, and a lay summary of a given paper

*Task*: For each paper, generate a lay summary of the specified length


*Evaluation*

The Lay Summary Task will be scored by using several ROUGE metrics to
compare the system output and the gold standard lay summary. As a follow-up
to the intrinsic evaluation, we will crowdsource a number of automatically
generated lay summaries to a panel of judges and a lay audience. Details of
the crowdsourcing evaluation will be announced with the sharing of the
final test corpus on July 1st.

All nominated entries will be invited to publish a paper in Open Access
(Author-Payment Charges will be waived) in a selected Elsevier publication.
Authors will be asked to provide an automatically generated lay summary of
their paper, together with their contribution.


*** LongSumm 2020: Shared Task on Generating Long Summaries for Scientific
Documents ** *
(Organisers: Michal Shmueli-Scheuer, Guy Feigenblat)

Most of the work on scientific document summarization focuses on generating
relatively short summaries (250 words or less). While such a length
constraint can be sufficient for summarizing news articles, it is far from
sufficient for summarizing scientific work. In fact, such a short summary
resembles more to an abstract than to a summary that aims to cover all the
salient information conveyed in a given text. Writing such summaries
requires expertise and a deep understanding in a scientific domain, as can
be found in some researchers’ blogs.

The LongSumm task opted to leverage blogs created by researchers in the NLP
and Machine learning communities and use these summaries as reference
summaries to compare the submissions against.

The corpus for this task includes a training set that consists of 1705
extractive summaries and around 700 abstractive summaries of NLP and
Machine Learning scientific papers. These are drawn from papers based on
video talks from associated conferences (Lev et al. 2019 TalkSumm) and from
blogs created by NLP and ML researchers. In addition, we create a test set
of abstractive summaries. Each submission is judged against one reference
summary (gold summary) on ROUGE and should not exceed 600 words.


*** Submission Information ** *

Authors are invited to submit full and short papers with unpublished,
original work. Submissions will be subject to a double-blind peer review
process. Accepted papers will be presented by the authors at the workshop
either as a talk or a poster. All accepted papers will be published in the
workshop proceedings.

*Submission Website*: Submission is electronic, using the Softconf START
conference management system: https://www.softconf.com/emnlp2020/sdp2020/

The submissions should be in PDF format and anonymized for review. All
submissions must be written in English and follow the EMNLP 2020 formatting
requirements: https://2020.emnlp.org/call-for-papers.
*Long paper submissions*: up to 8 pages of content, plus unlimited
references.
*Short paper submissions*: up to 4 pages of content, plus unlimited
references.
Final versions of accepted papers will be allowed 1 additional page of
content so that reviewer comments can be taken into account.

Shared Task registration: Participants of all shared tasks need to register
here:
https://docs.google.com/forms/d/e/1FAIpQLScfHzByrog-k299qBuCp3SbPWcb905_kmOWMvHpDH57VLpVrg/viewform.



* ** Important Dates ***

*Research track*:
Submission deadline – August15, 2020

Notification of Acceptance – September 29, 2020
Camera-ready submission due – October 10, 2020
Workshop – November 19, 2020

*Shared task track*:
Training set release – Feb 15, 2020
Deadline for registration – April 30, 2020 (remains open till
evaluation window starts)
Test set release (Blind) – July 1, 2020
System runs due – August 1, 2020
Preliminary system reports due – August 15, 2020
Camera-ready submission due – September 29, 2020
Workshop – November 19, 2020

*** SDP 2020 Keynote Speakers ***
SDP keynotes are invited by the organizing committee and will present in
the research track of the workshop.

Kuansan Wang, Managing Director, Microsoft Research Outreach Academic
Services
Steinn Sigurdsson, Scientific Director of arXiv and Professor at the
Pennsylvania State University

* ** SDP 2020 Journal Extension ***
In the past, the accepted authors were invited to submit an extended
version of their work to a special issue of a selected journal. The
organizers are currently in the process of identifying appropriate journals
to host a similar special issue this year. Relevant updates including
topics and requirements for this special issue will be shared on the
workshop website in due time.


*** Organizing Committee ***
Muthu Kumar Chandrasekaran, Amazon, Seattle, USA
Anita de Waard, Elsevier, USA
Guy Feigenblat, IBM Research AI, Haifa Research Lab, Israel
Dayne Freitag, SRI International, San Diego, USA
Tirthankar Ghosal, Indian Institute of Technology Patna, India
Drahomira Herrmannova, Oak Ridge National Laboratory, USA
Eduard Hovy, Research Professor, LTI, Carnegie Mellon University, USA
Petr Knoth, Open University, UK
David Konopnicki, IBM Research AI, Haifa Research Lab, Israel
Philipp Mayr, GESIS – Leibniz Institute for the Social Sciences, Germany
Robert M. Patton, Oak Ridge National Laboratory, USA
Michal Shmueli-Scheuer, IBM Research AI, Haifa Research Lab, Israel
Dominika Tkaczyk, Crossref, UK


*** Steering Committee ***
C. Lee Giles, David Reese Professor, College of Information Sciences and
Technology, Pennsylvania State University
Min-Yen Kan, Associate Professor, School of Computing, National University
of Singapore
Dragomir Radev, A. Bartlett Giamatti Professor of Computer Science, Yale
University
Jie Tang, Professor and Associate Chair of the Department of Computer
Science and Technology, Tsinghua University
Alex Wade, Group Technical Program Manager, Chan Zuckerberg Initiative
Kuansan Wang, Managing Director, Microsoft Research Outreach Academic
Services
Bonnie Webber, Professor, School of Informatics, University of Edinburgh

*** Programme Committee ***
Please visit our website for the complete list of PCs:
https://ornlcda.github.io/SDProc/programcommittee.html
More details available on the workshop website: <http://goog_1307099532>
https://ornlcda.github.io/SDProc/


With kind regards,
SDP 2020 organizing committee


--
Min-Yen KAN (Dr) :: Associate Professor :: National University of Singapore :: NUS School of Computing, AS6 05-12, 13 Computing Drive
Singapore 117417 :: +65 6516 1885(DID) :: +65 6779 4580 (Fax) :: [log in to unmask] (E) :: www.comp.nus.edu.sg/~kanmy (W)

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
October 2005
September 2005
August 2005
July 2005
June 2005
May 2005
April 2005
March 2005
February 2005
January 2005
December 2004
November 2004
October 2004
September 2004
August 2004
July 2004
June 2004
May 2004
April 2004
March 2004
February 2004
January 2004
December 2003
November 2003

ATOM RSS1 RSS2



LISTS.CLIR.ORG

CataList Email List Search Powered by the LISTSERV Email List Manager