Skip to content

RiTA-nlp/tweety-ita-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Tweety Ita Resources

This repository contains scripts and resources to replicate the training of Tweety Italian models.

The src folder contains python and bash script organized into:

  • continual_training: to run a small number of adaptation steps in Italian after the tokenizer swap;
  • alignment: scripts and recipes to run SFT and DPO with HF's alignment-notebook
  • datasets: code to create dataset resources

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published