Seems to be caused by: https://github.com/PanQiWei/AutoGPTQ/issues/373 For us to make this work, we have to downgrade to 0.3.2 or compile from source and redistribute ourselves.