Wednesday, May 28, 2025

Show HN: Image-to-Image Translation Model https://ift.tt/HsD2M3j

Show HN: Image-to-Image Translation Model We launched a v1 of a image to image translation API which translates the text on an images by replacing the existing text. For v1, it's pretty much a model pipeline: OCR current text -> generate mask -> erase text -> translate text -> use embedding comparison to find similar font -> map text back on image v1 was more like a prototype which already beats many of the similar services provided by Google, Azure, etc We're working on v2 where we're training a diffusion model to translate the text on the image. We've got the pipeline working for English and Chinese, and now we're building datasets for other languages. https://ift.tt/ua74z6K May 29, 2025 at 03:47AM

No comments:

Post a Comment