Introduction to Information Extraction

Series: Natural Language Processing for All

Image
graphic

When

3 – 4 p.m., Oct. 3, 2024

Join us for an introductory session on Information Extraction (IE)! Designed with a focus on automatic extraction of structured information from unstructured text, we will explore why information extraction is a key skill for a variety of research tasks.

IE is a critical component of many NLP applications, from data mining to knowledge graph construction. In this workshop, we will cover from fundamentals of information extraction, such as named entity recognition, relationship extraction, and event detection. We will look at various algorithms and tools used in IE. This workshop will provide hands-on experience with a simple project that demonstrates how to extract valuable insights from large text corpora, implemented using Python.

Enhance your abilities to automate information extraction, to transform raw text into meaningful data!

Join us for an engaging and accessible introduction to Natural Language Processing (NLP) and its practical applications for everyday tasks! In "NLP for All," we will explore the fundamental concepts behind NLP: From understanding how computers interpret human language; to discovering how to improve search queries, use regular expressions, find datasets, and learn about pipelines for working with language. Whether you're curious about chatbots, voice assistants, or automated text transcription and analysis, this series will demystify popular technologies and show you how they work.

What We Will Cover:

  • Foundations of NLP: Gain a solid grasp of NLP concepts and terminology without needing a technical background.
  • Real-World Applications: Explore practical uses of NLP in various contexts, such as improving search and information retrieval, generating and evaluating automatic transcriptions, and working with popular libraries such as spaCy, PyTorch and scikit-learn.
  • Hands-On Experience: We will illustrate NLP concepts in action with a well-documented code notebook, aimed at solving practical examples. We will also explore online sources for NLP tools and datasets, such as HuggingFace.
Pre-requisites:

SERIES: Natural Language Processing for All

Add the Series to your calendar. Add to calendar 

When:  Thursdays, 3-4 pm, Sept. 5 - Oct. 24, 2024
Where: Weaver Science-Engineering Library, Rm 212 and on  Zoom
Instructors: Megh Krishnaswamy
YouTube: Links will be added after the workshops have been presented

Workshop Materials

Contacts