Home | News & Events | Events | 2015 Events | NISO Webinars | November 18: Text Mining: Digging Deep for Knowledge

NISO Webinar: Text Mining: Digging Deep for Knowledge

Wednesday, November 18, 2015
1:00 p.m. - 2:30 p.m. (Eastern Time)

System Requirements: 

  • NISO has developed a quick tutorial, How to Participate in a NISO Web Event. Please view the recording, which is an overview of the web conferencing system and will help to answer the most commonly asked questions regarding participating in an online Webex event.
  • You will need a computer for the presentation and Q&A.
  • Audio is available through the computer (broadcast) and by telephone. We recommend you have a set-up for telephone audio as back-up even if you plan to use the broadcast audio as the voice over Internet isn't always 100% reliable.
  • Please check your system in advance to make sure it meets the Cisco WebEx requirements. It is your responsibility to ensure that your system is properly set up before each webinar begins.


About the Webinar

With the digital revolution, the ability to search vast amounts of information for specific bits of data has increased exponentially with more and more previously hard-copy only books and information being digitized and made available online. There are many organizations working to digitize content for the benefit of researchers and others. For example, HathiTrust is a partnership of organizations that offers digitized information from libraries all over the world.  Data mining partnerships between university libraries and vendors will hope to bring millions of books and periodicals to the fingertips of researchers. 

In this webinar, presenters will talk about the benefits and challenges to text mining and its impact on the library and information community.


Todd Carpenter, Executive Director, NISO


* * * * * * * 


Text Mining and the Research Library: The Humanities and Beyond
Bernie Reilly, President, Center for Research Libraries (CRL)

In his presentation Bernard Reilly will:

  • Survey text mining practice in various fields of research, including the humanities, public policy, economics, and linguistics
  • Briefly discuss the relationship between text mining, data visualization, and artificial intelligence
  • Examine the issues that arise for libraries in securing the ability and right to text mine from commercial database publishers.  

Bernard F. Reilly is president of the Center for Research Libraries, an international partnership of more than 200 university, college and independent research libraries.  Reilly directs CRL’s efforts to ensure the long-term integrity and accessibility of documentation and evidence -- tangible and digital -- critical to original research in the humanities, sciences and social sciences. Prior to his time at CRL, Reilly was Director of Research and Access at the Chicago History Museum (1997-2001), where he directed the administration, digitization and dissemination of the CHM library, archives, and architecture, audio, television, and pictorial collections.  Between 1987 and 1997 Reilly was Head of the Curatorial Section in the Prints and Photographs Division of the Library of Congress, which provided curatorial and policy support for the early development of the National Digital Library. 

* * * * * * * 


Enriching the Social Sciences Through Text Mining
Alan Maloney, Senior Product Analyst, Sage Publications

Text mining and semantic technologies are commonplace in scientific, technical and medical circles. Text mining in the social sciences is more experimental, arguably more exciting, and becoming increasingly important to the organization and discovery of scholarship.

This is why increasing numbers of publishers are mining their own content to add structure, metadata and insight. This talk will look at how we mine our own content at SAGE to improve the researcher experience, covering questions such as:

  • How do  you teach a computer to read social science content?
  • What are the challenges presented by a highly context-sensitive area like the social sciences?
  • How do you structure information that is meaningless without its original context?
  • What's the role for human indexers in a world where computers are getting better at understanding meaning?

Alan Maloney is Senior Product Analyst in the London office of SAGE Publications, working on how content is organized across SAGE's online products. Having started on the editorial team, he's always been close to the work that SAGE publishes and enjoys bringing a technological perspective on content, and vice versa, in order to make content more discoverable, findable and usable.

* * * * * * * 

The HathiTrust Research Center: Enabling New Knowledge Through Shared Infrastructure
Robert McDonald - HathiTrust Research Center Executive committee member; Associate Dean for Library Technologies, Indiana University

The HathiTrust Research Center (HTRC) enables computational access for nonprofit and educational users to published works in the public domain and, in the future, on limited terms to works in-copyright from the HathiTrust. The HathiTrust digital library, containing 14 million volumes spnanning various times and locations, along with HTRC's non-consumptive tools and services, provide scholars a unique opportunity of answering their research qeustions based upon this rich resource. One most prominent use of HathiTrust volumes is its support on text mining on this large-scale corpus. In this talk, we will briefly present an overview of HTRC, its tools, resources, and services.

*The HTRC is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.


SAVE! Register for multiple events.

If paying by credit card, register online.

If paying by check, please use this PDF form.

Registration closes on November 18, 2015 at 12:00 p.m. (ET)

Registration Costs 

  • NISO Member
    • $95.00 (US and Canada)
    • $109.00 (International)
  • NASIG Member
    • $95.00
  • Non-Member
    • $125.00 (US and Canada)
    • $149.00 (International)
  • Student
    • $49.00

Additional Information

  • Registration closes at 12:00 p.m. (ET) on November 18, 2015. Cancellations made by November 11, 2015 will receive a refund, less a $25 cancellation. After that date, there are no refunds.
  • Registrants will receive detailed instructions about accessing the webinar via e-mail the Monday prior to the event. (Anyone registering between Monday and the close of registration will receive the message shortly after the registration is received, within normal business hours.) Due to the widespread use of spam blockers, filters, out of office messages, etc., it is your responsibility to contact the NISO office if you do not receive login instructions.
  • If you have not received your Login Instruction email by 10:00 a.m. (ET) on the Tuesday before the webinar, at please contact the NISO office at nisohq@niso.org for immediate assistance.
  • Registration is per site (access for one computer) and includes access to the online recorded archive of the webinar. You may have as many people as you like from the registrant's organization view the webinar from that one connection. If you need additional connections, you will need to enter a separate registration for each connection needed.
  • If you are registering someone else from your organization, either use that person's e-mail address when registering or contact nisohq@niso.org to provide alternate contact information.
  • Library Standards Alliance (LSA) members receive one free webinar connection as part of their membership and DO NOT need to register for the event for this free connection. Your webinar contact will receive the login instructions the Monday before the event. You may have as many people as you like from the member's library view the webinar from that one connection. If you need additional connections beyond the free one, then you will need to enter a paid registration (at the member rate) for each additional connection required.
  • Webinar presentation slides and Q&A will be posted to the site following the live webinar.
  • Registrants and LSA member webinar contacts will receive an e-mail message containing access information to the archived webinar recording within 48 hours after the event. This recording access is only to be used by the registrant's or member's organization.