Swedish name: Stora språkmodeller (LLM:er) inom datahantering
This syllabus is valid: 2024-05-27 valid to 2025-05-11 (newer version of the syllabus exists)
Syllabus for courses starting after 2025-05-12
Syllabus for courses starting before 2025-05-11
Course code: 5DV242
Credit points: 7.5
Education level: First cycle
Main Field of Study and progress level:
Computing Science: First cycle, has less than 60 credits in first-cycle course/s as entry requirements
Grading scale: Pass with distinction, Pass, Fail
Responsible department: Department of Computing Science
Established by: Faculty Board of Science and Technology, 2024-02-14
This course investigates how large language models (LLMs) might contribute to solving long standing problems in data management. Specifically it addresses three related questions:
Since these questions can only be addressed after understanding LLMs and basic data management, the first half of the course is devoted to covering these concepts. This starts with feed forward neural networks, RNNs, LSTMs and Seq2Seq models. Then we cover Transformers, BERT and GPT. After this we quickly review the main concepts in data management including conceptual modelling via entity relationship diagrams (ERDs), basic SQL and typical architectures. After covering basic concepts, the second half of the course turns to the three questions above of how LLMs might address long standing data management problems. This includes direct few shot approaches, approaches based on LangChain and vector databases and finally retrieval-augmented generation (RAG) approaches. Additional approaches may also be covered.
Knowledge and understanding
After completing the course, the student should be able to:
Competence and skills
After completing the course, the student should be able to:
Judgement and approach
After completing the course, the student should be able to:
At least 30 ECTS in Computing Science or Mathematics including completed courses in programming (ideally Python), data structures and algorithms, databases, calculus, and linear algebra.
This course is a distance course which requires no local physical presence. Lectures are held over Zoom and course material is delivered via Canvas. Lectures are recorded, so attendance is flexible. Students answer problems and demonstrate their programming solutions in recitation sessions held over Zoom. While attendance in recitations is mandatory, there are multiple opportunities to accommodate student schedules. The course ends with a take home exam that students upload to Canvas. All work is conducted by the student individually.
Students must show their work at four separate Zoom-based recitations. There is also a take home final exam which students upload to Canvas for grading. The grade scale is Pass with distinction (VG), Pass (G), or Fail (U).
Adapted examination
The examiner can decide to deviate from the specified forms of examination. Individual adaptation of the examination shall be considered based on the needs of the student. The examination is adapted within the constraints of the expected learning outcomes. A student that needs adapted examination shall no later than 10 days before the examination request adaptation from the Department of Computing Science. The examiner makes a decision of adapted examination and the student is notified.
If the syllabus has expired or the course has been discontinued, a student who at some point registered for the course is guaranteed at least three examinations (including the regular examination) according to this syllabus for a maximum period of two years from the syllabus expiring or the course being discontinued.
Deep learning
Goodfellow Ian, Bengio Yoshua, Courville Aaron
Cambridge, MA : MIT Press : [2016] : xxii, 775 pages :
ISBN: 9780262035613
Mandatory
Search the University Library catalogue
Reading instructions: https://www.deeplearningbook.org/
In addition, a number of scientific articles will be used.