This repository is devoted to the course of Natural Language Processing for M.Sc. in Computer Science and Engineering at Politecnico di Milano, A.Y. 2023/2024
.
├── Checkpoints
│ ├── falcon-rw-1b_fine_tuning_2024_05_20_12_32_35
│ ├── gpt-neo-125m_fine_tuning_2024_05_21_16_54_18
│ ├── phi-1_5_fine_tuning_2024_05_22_21_00_38
├── WorldEmbedders.ipynb
├── deepspeed_config.json
├── README.md
Checkpointsfolder containing checkpoints of fine-tuned model trainingWorldEmbedders.ipynbmain notebook containing the project's codedeepspeed_config.jsonJSON file containing DeepSpeed configuration
The aim of the assignment is to apply the techniques we have learnt in class to analyse a text dataset.
OpenOrca-SlimOrca
- Website: https://huggingface.co/datasets/Open-Orca/SlimOrca
- Paper: https://arxiv.org/abs/2306.02707
- Description: Recorded interactions between a user and the chatbot ChatGPT, that can be used to train a model to act like ChatGPT.
- Task: Fine-tune a chatbot to mimic ChatGPT.
| Name | GitHub | |
|---|---|---|
| Davide Gesualdi | davide.gesualdi@mail.polimi.it | @dav-G |
| Luca Lain | luca.lain@mail.polimi.it | @lucalain |
| Alessandro Mosconi | alessandro2.mosconi@mail.polimi.it | @Alessandro-Mosconi |