Skip to content

Conversation

@tindzk
Copy link
Member

@tindzk tindzk commented Aug 1, 2019

In the process of implementing the reverse direction, several limitations were addressed in the rules. A lower error rate was observed on a small Wikipedia text corpus.

Notable changes include the removal of the vertical bar (|) for precedence. Instead, the bi-gram "kh" was introduced. Capital letters are now handled better. In Russian, ъ was mapped to ` since double quotes are commonly used in Slavic texts.

Closes #2.

@tindzk tindzk changed the title Russian: Implement Cyrillic to Latin conversion Implement Cyrillic to Latin conversion Aug 5, 2019
In the process of implementing the reverse direction, several limitations were addressed in the rules. A lower error rate was observed on a small Wikipedia text corpus.

Notable changes include the removal of the vertical bar (|) for precedence. Instead, the bi-gram "kh" was introduced. Capital letters are now handled better. In Russian, ъ was mapped to ` since double quotes are commonly used in Slavic texts.

Closes #2.
@tindzk tindzk merged commit e84c03c into master Aug 5, 2019
@tindzk tindzk deleted the feat/to-latin branch August 5, 2019 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Implement Cyrillic to Latin

2 participants