Skip to content
Snippets Groups Projects
  1. Jan 10, 2013
    • Jean Chalard's avatar
      Add words to Portuguese · 84f932be
      Jean Chalard authored
      >>> dictionaries/pt_BR_wordlist.combined.gz
      Header :
        date : 1355802839 <=> 1357790917
        version : 29 <=> 30
      Body :
      Added: à 30
      Added: é 30
      Added: ò 30
      Added: ô 30
      
      >>> dictionaries/pt_PT_wordlist.combined.gz
      Header :
        date : 1355802856 <=> 1357790930
        version : 29 <=> 30
      Body :
      Added: à 30
      Added: é 30
      Added: ò 30
      Added: ô 30
      
      >>> java/res/raw/main_pt_br.dict
      Header :
        date : 1355802839 <=> 1357790917
        version : 29 <=> 30
      Body :
      Added: à 30
      Added: é 30
      Added: ò 30
      Added: ô 30
      
      Bug: 7966948
      Change-Id: I71c0986cf616d67926d0a6a0e53099b04b0427d5
      84f932be
  2. Dec 18, 2012
    • Jean Chalard's avatar
      Update dictionaries · 21dbe370
      Jean Chalard authored
      cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl,
      pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match
      spec. This has no large effect in the practice except the
      dictionary will become stronger vs spatial model (especially in
      lower count corpora, like lt, lv, sr)
      en* : Small changes (rounding going the other way essentially)
      ru : the above rescaling, and remove the following words:
      Дре, ОСТа, Планше, легкими, легком, легкому, легкости,
      легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой,
      нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую;
      other accented forms were already in the dictionary.
      
      Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6
      21dbe370
  3. Nov 22, 2012
    • Jean Chalard's avatar
      Update the dictionaries · 526371f2
      Jean Chalard authored
      This brings the dictionaries to the same level as
      master. The change in master reflected by this
      change is I94fe7f8f.
      
      - XML de dict :
        Update header only.
      - XML en dict :
        Add the not_a_word flag to all whitelist entries.
        Demote "il:92" and "shel:52" to freq 0.
      - XML en_GB dict :
        Add the not_a_word flag to all whitelist entries.
        Add "milk:122" and "practice:143"
      - XML fr dict :
        Add missing words : "Nostradamus:40" "défendais:30"
        "gmail:50" "générale:140" "hm:0" "hmm:0" "y'en:130"
        "l'apocalypse:31" "m'épuise:30" "recontacter:80"
        "t'annonce:30"
        Set freq of non-word shortcuts for digraphs to 1 instead
        of 0, allowing to gesture them.
      - XML ru dict :
        Remove a lot of two-character non-words.
      
      - Binary de dict :
        Remove the obsolete "options" header, and add the "dictionary"
        header.
      - Binary en dict :
        Flag "hoe" "hoes" "il" "shel" as non-words.
        Also drop freq of "il" and "shel" to 0
        Add the "locale" header that was missing.
      - Binary es dict :
        Add the "dictionary" header.
      - Binary fr dict :
        Add the same words as above. Non-word shortcuts were already
        set to 1.
      - Binary it dict :
        Add a "dictionary" header. Also change freq of
        "Šarapova" from 50 to 37; not sure why it was 50.
      - Binary pt_BR dict :
        Add a "dictionary" header.
      - Binary ru dict :
        Add a "dictionary" header and remove the same words as above.
      
      Bug: 7458565
      Change-Id: I22dd176ed2df99b847047f2d1045f6d861ccb8b0
      526371f2
  4. Nov 21, 2012
    • Jean Chalard's avatar
      Update dictionaries and fix mistakes · d5f53710
      Jean Chalard authored
      - Combined de dict :
        Remove digraph shortcuts that were in by mistake.
      - Combined en dict :
        Set freq of "baton" "batons" "mace" "puff"
        "puffs" and "tasers" to zero. They are offensive
        in en_GB.
      - Combined en_GB dict :
        Change freq of "il" to 0 and flag it "not a word". Still
        in the dict as a whitelist entry for "I'll"; for some
        reason it had freq 99.
        Add "milk:122" and "practice:143"
      - Combined fr dict :
        Add missing words : "Nostradamus:40" "défendais:30"
        "gmail:50" "générale:140" "hm:0" "hmm:0" "y'en:130"
        "l'apocalypse:31" "m'épuise:30" "recontacter:80"
        "t'annonce:30"
        Set freq of non-word shortcuts for digraphs to 1 instead
        of 0, allowing to gesture them.
      - Combined ru dict :
        Remove a lot of two-character non-words.
      
      - Binary de dict :
        Remove the obsolete "options" header, and add the "dictionary"
        header.
      - Binary en dict :
        Flag "hoe" "hoes" "il" "shel" as non-words.
        Also drop freq of "il" and "shel" to 0
        Add the "locale" header that was missing.
      - Binary es dict :
        Add the "dictionary" header.
      - Binary fr dict :
        Add the same words as above. Non-word shortcuts were already
        set to 1.
      - Binary it dict :
        Add a "dictionary" header. Also change freq of
        "Šarapova" from 50 to 37; not sure why it was 50.
      - Binary pt_BR dict :
        Add a "dictionary" header.
      - Binary ru dict :
        Add a "dictionary" header and remove the same words as above.
      
      For all dictionaries : bump the version to 27.
      
      Change-Id: I94fe7f8f42b31fdad223085c00a94115e14d2276
      d5f53710
  5. Sep 21, 2012
    • Jean Chalard's avatar
      Use all Lexiteria sources and update existing directories. · d0cf9649
      Jean Chalard authored
      New dictionaries :
      - Danish
      - Greek
      - Finnish
      - Lithuanian
      - Latvian
      - Dutch
      - Polish
      - Russian
      - Slovene
      - Serbian
      - Swedish
      - Turkish
      
      Also, compress those files to reduce the footprint in the
      repository.
      Also, update and improve English and French dictionaries, and
      add the ligatures shortcut into the French dictionary.
      Finally, move the Russian binary dictionary here now that it
      can at last be open sourced.
      
      Bug: 5587752
      Bug: 6775251
      Bug: 6995793
      Bug: 7149666
      Change-Id: Iec9831d4dce425a2b5b0657571e4448436610525
      d0cf9649
  6. May 25, 2012
Loading