- Optimizations and improvements based on Apple's project "ML-ANE-Transformers".
- The adaptation of the GPT-2 model in ANE will be carried out later in the project.
- In the original official example I made a slight modification to make it a model of GPT-2 architecture, and can be optimized and accelerated on the ANE engine.
# clone this repo
>> git clone https://github.com/StarxSky/ANE-GPT-New
#First you need to install the required packages
>> cd ANE-GPT-New-main
# Strat Install
>> pip install -r requirements.txt
# run 'Train.py'
>> python Train.py
- This repository uses Apple's original license,Apple reserves the rights to the original code