Skip to content

TResNet models #122

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Apr 13, 2020
Merged

TResNet models #122

merged 3 commits into from
Apr 13, 2020

Conversation

mrT23
Copy link
Contributor

@mrT23 mrT23 commented Apr 12, 2020

add 3 TResNet models:
tresnet_m: 80.8% on input resolution 224, inference speed faster than resnet50
tresnet_l: 81.5% on input resolution 224
tresnet_xl: 82.0% on input resolution 224

the models need the InPlaceABN package.

@rwightman
Copy link
Collaborator

@mrT23 thanks for the PR. Does the model work with the train script in this repo? You might need to eat a few extra args like global pool selection. Could you check, you don't need to hook them up, but I would like training to run without crash. I am not at computer for a few days now otherwise would check. Thx

@rwightman
Copy link
Collaborator

Nevermind, you don't pass the kwargs through. I'll merge.

@rwightman rwightman merged commit ebf82b8 into huggingface:master Apr 13, 2020
@mrT23
Copy link
Contributor Author

mrT23 commented Apr 13, 2020

just for the record: :-)

  • i ran validate.py on all models and made sure i reproduce results
  • i ran train.py and saw that training is running.
    with JSD and some lr fine-tuning, we should surpass the 80.8% for tresnet_m, but it is to be proven

thanks

guoriyue pushed a commit to guoriyue/pytorch-image-models that referenced this pull request May 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants