Would it be possible to change the text encoder? Maybe integrate a model like T5XXL, so we can write more freely and clarify intentions and nuances? I know it’s not that simple , but working with that kind of model is a game changer!