-
Couldn't load subscription status.
- Fork 1.2k
[hf] HF PT Training DLCs #5301
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
[hf] HF PT Training DLCs #5301
Conversation
|
Make sure to fix the unit tests |
|
I made some changes to add py312, can I get a retrigger? @mollyheamazon thanks for your help along the way |
: branch 'hf-pt-tr' of https://github.com/pagezyhf/sagemaker-python-sdk into hf-pt-tr
|
I'd need some help on this one as I am changing the test pipeline. Could somebody make sure TORCH_DISTRIBUTED_GPU_SUPPORTED_FRAMEWORK_VERSIONS is up to date?
|
|
@arjkesh maybe you can help me on this one: is torch 2.8 supported on a ml.g5.4xlarge? |
Adding
https://github.com/aws/deep-learning-containers/releases/tag/v1.0-hf-4.55.0-pt-2.7.1-tr-gpu-py312
and
https://github.com/aws/deep-learning-containers/releases/tag/v1.0-hf-4.56.2-pt-2.8.0-tr-gpu-py312
to the image_uris