skrywer

avatar

Profiel van pmindia

Geskep by 2021.02.03
Gemaak deur Administratorus
lisensie: Unknown

Licence ------- The corpus is released under the CC-BY-4.0, in other words the corpus can be freely shared and adapted as long as appropriate credit is give. https://creativecommons.org/licenses/by/4.0/ Code ---- The code for crawling and aligning is available at https://github.com/bhaddow/pmindia-crawler Citation -------- If you use the corpus, please cite: 
@ARTICLE{2020arXiv200109907H,
 author = {{Haddow}, Barry and {Kirefu}, Faheem},
 title = "{PMIndia -- A Collection of Parallel Corpora of Languages of India}",
 journal = {arXiv e-prints},
 keywords = {Computer Science - Computation and Language},
 year = "2020",
 month = "Jan",
 eid = {arXiv:2001.09907},
 pages = {arXiv:2001.09907},
archivePrefix = {arXiv},
 eprint = {2001.09907}
}
 Acknowledgements ---------------- This work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825299 (Gourmet). We thank the Prime Minister's Office of the Government of India for making the content available for re-distribution. Contact ------- Barry Haddow (bhaddow at inf dot ed dot ac dot uk)