Skip to content
This repository has been archived by the owner on Mar 1, 2024. It is now read-only.

Support for more languages #10

Open
EmilStenstrom opened this issue May 1, 2020 · 7 comments
Open

Support for more languages #10

EmilStenstrom opened this issue May 1, 2020 · 7 comments
Labels
enhancement New feature or request

Comments

@EmilStenstrom
Copy link

It looks like this architecture would work for non-english languages too. Wikipedia is availiable in more languages, flair has embeddings in other languages, and BERT is available elsewhere.

Is there something stopping this from being applied to eg. Swedish?

@ledw ledw added the enhancement New feature or request label May 1, 2020
@ledw
Copy link
Contributor

ledw commented May 1, 2020

Hi @EmilStenstrom, thanks for your interest. Supporting more languages is WIP and we plan to include that in future versions.

@EmilStenstrom
Copy link
Author

Let me know if there’s something I can do to help! (Native Swedish speaker)

fabiopetroni added a commit that referenced this issue May 18, 2020
@Nuccy90
Copy link

Nuccy90 commented Oct 27, 2020

Hi @EmilStenstrom we meet again!

We are looking into training a Swedish BLINK, but we have noticed there is not much documentation on data preprocessing and training pipelines. Would it be possible for someone to add a step by step guide for training a model for another language? Especially how you go from the wikipedia dumps to training data. @ledw

@SuzanaK SuzanaK mentioned this issue Jun 1, 2021
@Giovani-Merlin
Copy link

I've created a new repository for training bi-encoder models, following this tutorial you can train the model in another language using a correct transformer model using the BLINK code or following this tutorial.
But flair support was not implemented yet.

@JLUGQQ
Copy link

JLUGQQ commented May 2, 2022

I've created a new repository for training bi-encoder models, following this tutorial you can train the model in another language using a correct transformer model using the BLINK code or following this tutorial. But flair support was not implemented yet.

The link is not available now. Can you update it? Thanks.

@alexnotes
Copy link

I've created a new repository for training bi-encoder models, following this tutorial you can train the model in another language using a correct transformer model using the BLINK code or following this tutorial. But flair support was not implemented yet.

Hi buddy, could you update this tutorial link? it's not available. thanks.

@banditelol
Copy link

I've created a new repository for training bi-encoder models, following this tutorial you can train the model in another language using a correct transformer model using the BLINK code or following this tutorial. But flair support was not implemented yet.

Hi buddy, could you update this tutorial link? it's not available. thanks.

there's a tutorial on how to train on smaller biencoder model here #116

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

7 participants