Import Upstream version 0.50-2

parents
This diff is collapsed.
This package was designed by Thierry Vignaud <tvignaud@mandrakesoft.com>
(c) 1999-2000 MandrakesSoft
It's released under GPL (GNU Public Licence) :
This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
# Generated with Aspell Dicts "proc" script version 0.50.1
lang = br
version = 0.50-2
cwl_files = br.cwl
data_files = br.dat br_phonet.dat
doc_files = README COPYING Copyright
extra_files = configure info Makefile.pre
multi_files = br.multi breton.alias brezhoneg.alias
rws_files = br.rws
distdir=aspell-${lang}-${version}
all: ${rws_files} ${data_files}
install: all
mkdir -p ${DESTDIR}${dictdir}/
cp ${rws_files} ${multi_files} ${DESTDIR}${dictdir}/
cd ${DESTDIR}${dictdir}/ && chmod 644 ${rws_files} ${multi_files}
mkdir -p ${DESTDIR}${datadir}/
cp ${data_files} ${DESTDIR}${datadir}/
cd ${DESTDIR}${datadir}/ && chmod 644 ${data_files}
clean:
rm -f ${rws_files}
uninstall:
-cd ${DESTDIR}${dictdir}/ && rm ${rws_files} ${multi_files} ${link_files}
-cd ${DESTDIR}${datadir}/ && rm ${data_files}
dist:
perl proc
./configure
@make dist-nogen
dist-nogen:
-rm -r ${distdir}
mkdir ${distdir}
cp -p ${extra_files} ${cwl_files} ${multi_files} ${doc_files} ${data_files} ${distdir}/
-cp -pr doc/ ${distdir}/
gtar cfj ${distdir}.tar.bz2 ${distdir}/
rm -r ${distdir}/
distcheck:
tar xfj ${distdir}.tar.bz2
cd ${distdir} && ./configure && make
rel:
mv ${distdir}.tar.bz2 ../rel
br.rws: br.cwl
.SUFFIXES: .cwl .rws
.cwl.rws:
${WORD_LIST_COMPRESS} d < $< | ${ASPELL} ${ASPELL_FLAGS} --lang=br create master ./$@
GNU Aspell Breton Word List Package
Version 0.50-2
2002-07-28
by Kevin Atkinson (kevina@gnu.org)
Word List by:
Thierry Vignaud (tvignaud at mandrakesoft com)
Copyright Terms: GPL
see the file Copyright for the exact copyright.
Source Verson: 0.2.1-2mdk
This is the Aspell word list for Breton.
If Aspell is installed and aspell and word-list-compress are all
in the path first do a:
./configure
Which should output something like:
Finding Dictionary file location ... /usr/local/lib/aspell
Finding Data file location ... /usr/local/share/aspell
if it did not something likely went wrong.
After that build the package with:
make
and then install it with
make install
If any of the above mentioned programs are not in your path than the
variables, ASPELL and/or WORD_LIST_COMPRESS need to be set to the
commands (with path) to run the utilities. These variables may be set
in the environment before configure is run or specified at the command
line using the following syntax
./configure --vars VAR1=VAL1 ...
Other useful variables configure recognizes are ASPELL_PARMS, and DESTDIR.
To clean up after the build:
make clean
To uninstall the files:
make uninstall
After the word lists are installed you can use the main one (br) by
setting the LANG environmental variable to br or running Aspell
with --lang=br. You may also chose the dictionary directly
with the -d or --master option of Aspell. You can chose from any of
the following dictionaries:
br (breton brezhoneg)
Whereas the names in parentheses are alternate names for the
dictionary preceding the parentheses.
If you have any problem with installing or using the word lists please
let me (Kevin Atkinson) know.
If you have problems with the actual word lists please contact one of
the Word lists authors as I do not maintain the actual Word Lists.
Any additional documentation that came with the original word list can
be found in the doc/ directory.
This source diff could not be displayed because it is too large. You can view the blob instead.
# Generated with Aspell Dicts "proc" script version 0.50.1
name br
charset iso8859-1
special ' -*- . -*- - -*-
soundslike br
# Generated with Aspell Dicts "proc" script version 0.50.1
add br.rws
# phonetic_brezhoneg.h - phonetic transformation rules for use with phonetic.c
# Copyright (C) 2000 Rémi Vanicat
#
# This library is free software; you can redistribute it and/or
# modify it under the terms of the GNU General Public License
# version 2 as published by the Free Software Foundation;
#
# This library is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
# Lesser General Public License for more details.
#
# You should have received a copy of the GNU Lesser General Public
# License along with this library; if not, write to the Free Software
# Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
#
# Thierry Vignaud may be reached by email at tvignaud@mandrakesoft.com
#
version brezhoneg 0.0000000001
#EMME ~ AME
AI E
AN(AEUIO)- AM
AN A
AMM AM
AM(AEUIO)- AM
AM A
AU O
A A
 A
À A
BB P
B P
Ç S
C(EI)- S
CU(EI)- K
CC(EI)- X
CC K
CH CH
C K
DD T
D T
EMMENTAL EMATAL
EMMENTHAL EMATAL
EM(AEIOU)- EM
EM A
ET$ E
EU E
EN(AEUIO)- EM
EN A
ER$ E
EO O
EAU O
E E
È E
É E
Ê E
F F
G(EIY)- J
GU(EIY)- G
G G
H _
I I
Î I
J J
KS X
K K
LL L
L L
MM M
M M
NN M
N M
OEU E
OU U
OÙ U
O O
Ô O
PP P
PH F
P P
QU K
Q K
RR R
R R
S$ _
SS S
S S
TT T
T T
U U
Ù U
Û U
V V
W W
X X
Y(AEOU)- IL
Y I
ZZ S
Z S
#The rules in a different view:
#
# Exceptions:
#
# Beginning of word: "gn", "kn-", "pn-", "wr-" ----> drop first letter
# "Aebersold", "Gnagy", "Knuth", "Pniewski", "Wright"
#
# Beginning of word: "x" ----> change to "s"
# as in "Deng Xiaopeng"
#
# Beginning of word: "wh-" ----> change to "w"
# as in "Whalen"
# Beginning of word: leading vowels are transformed to "*"
#
# "[crt]ough" and "enough" are handled separately because of "F" sound
#
#
# A --> A at beginning
# _ otherwise
#
# B --> B unless at the end of word after "m", as in "dumb", "McComb"
#
# C --> X (sh) if "-cia-" or "-ch-"
# S if "-ci-", "-ce-", or "-cy-"
# SILENT if "-sci-", "-sce-", or "-scy-", or "-cq-"
# K otherwise, including in "-sch-"
#
# D --> K if in "-dge-", "-dgy-", or "-dgi-"
# T otherwise
#
# E --> A at beginnig
# _ SILENT otherwise
#
# F --> F
#
# G --> SILENT if in "-gh-" and not at end or before a vowel
# in "-gn" or "-gned" or "-gns"
# in "-dge-" etc., as in above rule
# K if before "i", or "e", or "y" if not double "gg"
#
# K otherwise (incl. "GG"!)
#
# H --> SILENT if after vowel and no vowel or "Y" follows
# or after "-ch-", "-sh-", "-ph-", "-th-", "-gh-"
# or after "rh-" at beginning
# H otherwise
#
# I --> A at beginning
# _ SILENT otherwise
#
# J --> K
#
# K --> SILENT if after "c"
# K otherwise
#
# L --> L
#
# M --> M
#
# N --> N
#
# O --> A at beginning
# _ SILENT otherwise
#
# P --> F if before "h"
# P otherwise
#
# Q --> K
#
# R --> SILENT if after vowel and no vowel or "Y" follows
# R otherwise
#
# S --> X (sh) if before "h" or in "-sio-" or "-sia-"
# SK if followed by "ch(eo)" (SCH(EO))
# S otherwise
#
# T --> X (sh) if "-tia-" or "-tio-"
# 0 (th) if before "h"
# silent if in "-tch-"
# T otherwise
#
# U --> A at beginning
# _ SILENT otherwise
#
# V --> V if first letter of word
# F otherwise
#
# W --> SILENT if not followed by a vowel
# W if followed by a vowel
#
# X --> KS
#
# Y --> SILENT if not followed by a vowel
# Y if followed by a vowel
#
# Z --> S
# Generated with Aspell Dicts "proc" script version 0.50.1
add br.multi
# Generated with Aspell Dicts "proc" script version 0.50.1
add br.multi
#!/bin/sh
# Note: future version will have a syntax something like
# ./configure [OPTIONS]
# Where OPTIONS is any of:
# --help
# --codes CODE1 ...
# --sizes SIZE1 ...
# --jargons JARGON1 ...
# --extras EXTRA1 ...
# --vars VAR1=VAL1 ...
# which is why I warn when --vars is not used before VAR1=VAL1
# Avoid depending upon Character Ranges.
# Taken from autoconf 2.50
cr_az='abcdefghijklmnopqrstuvwxyz'
cr_AZ='ABCDEFGHIJKLMNOPQRSTUVWXYZ'
cr_09='0123456789'
cr_alnum=$cr_az$cr_AZ$cr_09
# also taken form autoconf
case `echo "testing\c"; echo 1,2,3`,`echo -n testing; echo 1,2,3` in
*c*,-n*) ECHO_N= ECHO_C='
' ECHO_T=' ' ;;
*c*,* ) ECHO_N=-n ECHO_C= ECHO_T= ;;
*) ECHO_N= ECHO_C='\c' ECHO_T= ;;
esac
mode=none
for option
do
case $option in
--vars)
mode=vars
;;
*=*)
if test $mode != vars; then
echo "Warning: future versions will require --vars before variables are set"
mode=vars
fi
# Taken from autoconf 2.50
envvar=`expr "x$option" : 'x\([^=]*\)='`
# Reject names that are not valid shell variable names.
expr "x$envvar" : ".*[^_$cr_alnum]" >/dev/null &&
{ echo "$as_me: error: invalid variable name: $envvar" >&2
{ (exit 1); exit 1; }; }
optarg=`echo "$optarg" | sed "s/'/'\\\\\\\\''/g"`
eval "$envvar='$optarg'"
export $envvar
;;
--help)
echo "Usage: ./configure [--help | --vars VAR1=VAL1 ...]"
echo " Note: Variables may also be set in the environment brefore running config"
echo " Useful vars: ASPELL ASPELL_PARMS WORD_LIST_COMPRESS DESTDIR"
exit 0
;;
*)
echo "Error: unrecognized option $option";
exit 1
;;
esac
done
if test x = "x$ASPELL"
then ASPELL=aspell; fi
if test x = "x$WORD_LIST_COMPRESS"
then WORD_LIST_COMPRESS=word-list-compress; fi
echo $ECHO_N "Finding Dictionary file location ... $ECHO_C"
dictdir=`$ASPELL dump config dict-dir`
echo $dictdir
echo $ECHO_N "Finding Data file location ... $ECHO_C"
datadir=`$ASPELL dump config data-dir`
echo $datadir
echo "ASPELL = $ASPELL" > Makefile
echo "ASPELL_FLAGS = $ASPELL_FLAGS" >> Makefile
echo "WORD_LIST_COMPRESS = $WORD_LIST_COMPRESS" >> Makefile
echo "DESTDIR = $DESTDIR" >> Makefile
echo "dictdir = $dictdir" >> Makefile
echo "datadir = $datadir" >> Makefile
echo >> Makefile
cat Makefile.pre >> Makefile
Philippe Argouarch <argouarch@earthlink.net>
Thierry Vignaud <tvignaud@mandrakesoft.com>
c'hwirinial ??
chital, chinkal (from chintal)
diantao evit d(a v)ihan(ana)tav (kl).
diantao evit d(a v)ihan(aa)tav (kl).
[vannetais] : etal jol
flag for 'fagod str. (-o) -enn (-o)'
jabotig-ruz jabodig- g. -o-
jedot jilot g. -o
This diff is collapsed.
* Wed Jan 10 2001 Thierry Vignaud <tvignaud@mandrakesoft.com> 0.2.1
- initial aspell support framework
- more docs
- Makefile cleaning
* Wed Jan 10 2001 Thierry Vignaud <tvignaud@mandrakesoft.com> 0.2
- merged my word list with Philippe Argouarch's one : we'v 33281 words now.
* Thu Nov 25 1999 Thierry Vignaud <tvignaud@mandrakesoft.com> 0.1.2
- more words (1000<)
* Wed Nov 17 1999 Thierry Vignaud <tvignaud@mandrakesoft.com> 0.1.1
- more words
* Tue Nov 05 1999 Thierry Vignaud <tvignaud@mandrakesoft.com> 0.1
- First version (mainly town names and words that i remember)
mkdir -p /usr/lib/ispell
install -m 644 brezhoneg.aff /usr/lib/ispell/brezhoneg.aff
install -m 644 brezhoneg.hash /usr/lib/ispell/brezhoneg.hash
cd /usr/lib/ispell
ln -s brezhoneg.aff bzh.aff
ln -s brezhoneg.hash bzh.hash
ln -s brezhoneg.aff breton.aff
ln -s brezhoneg.hash breton.hash
ln -s brezhoneg.aff br.aff
ln -s brezhoneg.hash br.hash
mv -f brezhoneg.hash brezhoneg.ispell
buildhash brezhoneg.ispell brezhoneg.aff brezhoneg.hash &> /dev/null
rm -f brezhoneg.ispell.stat brezhoneg.ispell.cnt brezhoneg.ispell
This package was designed by Thierry Vignaud <tvignaud@mandrakesoft.com>
(c) 1999-2000 MandrakesSoft
It's released under GPL (GNU Public Licence) :
This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
BRETON (BREZHONEG) ISPELL DICTIONNARY
GERIADUR BREZHONEK EVIT ISPELL
DESCRIPTION :
-------------
This is a breton dictionary for ispell.
To check spelling in brezhoneg text (Latin1 aka ISO-8859 -
though the C'H and CH letters are not present in Latin1 :-) ),
HOW TO USE :
------------
use the following command:
ispell -d brezhoneg <file_name>
If you always want to use breton dictionnay, then add the following
to your ~/.bashrc
export DICTIONARY=brezhoneg.hash
Then the 'brezhoneg' dictionary will be used for spell checking
when using ispell. Please consult the documentation on ispell
for further details.
Ispell will then store additions to your personal dictionary in the
file ~/.ispell_brezhoneg. Please mail this to me, to be added back to the
main dictionary file. I shall endeavour to release updates as frequently
as possible.
MAKEDICT program can be used to construct (or extend) dictionary
by exploring russian texts. This program extracts russian word from
standard input, inserts them into hash table and then tries to
apply some heuristic rules to produce new forms of each word.
Only words with occurrence count less than some predefined value
depending on the length of the word are ignored (assumed to be mistyping).
Currently we reject all words with counter less than
MIN_COUNT + average_count(length(word))/count(word)/DISPERSION
where DISPERSION and MIN_COUNT parameters can be specified from command line.
Default value for MIN_COUNT is 2 and for DISPERSION is 50.
If "-check" option is specified special checking algorithm is used to
throw away all possible mistypings. We use the following rule:
if word with counter less than checking threshold specified by "-check"
option can be transformed by adding, removing or replacing any one letter
to another word which has counter DISPERSION times bigger, then
first word is removed from dictionary (assumed to be mistyping).
To implement this algorithm another hash table is used. For each word
two new elements are inserted in this hash table with hash values
calculated for two half of the word (so if error is in one half
of the word then hash values for another part will be equal with
correspondent hash value calculated for correct word).
Rules which are used for generation of new forms of a word are not real
rules of russian language word declension but
some heuristics allowing to produce most frequently used forms and
possibly not generate words with mistakes
(sometimes it is difficult to distinguish forms of nouns,
verbs and adjectives). To choose concrete rule algorithm tries to locate
specified suffixes in the word. If suffix is found then we construct
all forms of word using suffixes from rule and root of the word and
look for them into the dictionary. The rule with the biggest number of
matches is chosen (but only if number of matches is bigger than
MIN_MATCH parameter). Default value for MIN_MATCH is 2.
At the final stage program outputs words from hash table to standard output
using ISPELL affix rules as well as file with affix rules (if option -aff
is specified). Dictionary and affix files can be used by BUILDHASH
utility to create hash file for ISPELL.
Below is small description of MAKEDICT command line options:
-alt assume input is in alternative DOS character set (default)
-koi assume input in in KOI-8 character set
-aff generate affix file "russian.aff"
-wc output count of occurrence for each word
-stat output statistics for average word occurrence
-read read list of words with counters (see below)
-mincount <number> specify value of MIN_COUNT
-maxcount <number> specify miximal count of word to be placed in dictionary
-match <number> specify value of MIN_MATCH
-check <number> specify checking threshold
-disp <number> specify value of DISPERSION
As far as scanning all input sources takes a lot of time,
it is useful to create once list of all words with counters of occurrences
and then read only this file (it is much faster). List of all words
with occurrence counters can be produced by MAKEDICT with the following options
MAKEDICT -wc -count 0 -match 100 < input-source > word-list-file
You can then perform various experiment with different values of
parameters and list of rules, using produced file with word list as input:
MAKEDICT -read ... < word-list-file > dictionary
Dictionary produced by MAKEDICT utility can be sort by 'sort' utility.
TODO:
-----
* support mutations (urgent)
* support of hen-brezoneg ("brezhoneg kozh" aka "old breton"), les
graphies KLT, ZH (peurunvan aka "breton totally united"), Skolveurieg
(aka "university breton") [maybe] and etrerannyezhel (aka
"super-united breton")
Of course, current is _peurunvan_.
* more (or less?) utils
IN PROGRESS:
------------
* clean Makefile
* [RPM] provide aspell support too [PARTIAL]
DONE:
-----
* more docs
* [RPM] split aspell & ispell packages
* [RPM] ispell-br => spell-br
* [RPM] move scripts from %post to %install
* [RPM] major spec cleaning and simplification
- use more macros
- move all post{un,} scripts in %%install
* clean word list for lord BuildHash
name_english Breton
name_ascii Brezhoneg
lang br
author:
name Thierry Vignaud
email tvignaud at mandrakesoft com
copyright GPL
version 0.50-2
source-version 0.2.1-2mdk
charset iso8859-1
special ' -*- . -*- - -*-
soundslike phonet
alias br breton brezhoneg
dict:
name br
add br
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment