PhD Course

Cutting-Edge Text Mining: from Consolidated Natural Language Processing Methods to Advanced Large Language Models.


  • Teacher: Prof. Gianluca Moro (DISI, University of Bologna)
  • Year of study: first, second or third year
  • Teaching period: November 2024; October 2025
  • Total hours: 12
  • Doctoral credits: 2,4
  • Assessment method: by project
  • Readings: slides and materials

October 2025 edition:

  • 22/10 - 15:00-18:30 - online (link) + room 5.6 (viale Risorgimento 2, Bologna)
  • 27/10 - 15:00-18:30 online (link)
  • 29/10 - 15:00-18:30 - online (link) + room 5.6 (viale Risorgimento 2, Bologna)
  • 31/10 - 16:00-17:30 online (link)

The course presents the fundamentals of text mining to empower students with the skills of knowing and performing the most important downstream tasks in natural language processing. To ensure the content is easily accessible, the course methodically introduces a selection of the most pivotal models, methods, techniques, and algorithms that have shaped the discipline, along with ongoing breakthroughs that have culminated in the development of current large language models (LLM).

The course examines the latest generative model innovations, highlighting their functionalities, limits, applications and the theoretical concepts that could fuel future tech advancements. By the end, students will know both the essential principles of the discipline and advanced efficient techniques to employ and train modern compressed large language models in both practical settings and academic research.

The course will also provide an overview of the most promising current research perspectives in the field, offering insights into the frontier of advancements and emerging trends, from knowledge-enhanced NLP to differentiable reasoning and explainability, including the emergent cognitive capabilities of the new LLMs