AminetAminet
Search:
84541 packages online
About
Recent
Browse
Search
Upload
Setup
Services

dev/misc/Uk2GrCorpus.lha

Mirror:Random
Showing:m68k-amigaosppc-amigaosppc-morphosi386-arosi386-amithlonppc-warpupppc-powerupgeneric
No screenshot available
Short:compact english-greek parallel aligned linguistic corpus
Author:olethros at geocities.com (Christos Dimitrakakis)
Uploader:olethros geocities com (Christos Dimitrakakis)
Type:dev/misc
Architecture:m68k-amigaos
Date:2000-01-07
Download:http://aminet.net/dev/misc/Uk2GrCorpus.lha - View contents
Readme:http://aminet.net/dev/misc/Uk2GrCorpus.readme
Downloads:617

A compact english-greek parallel aligned linguistic corpus :) use for
Automatic Translation.

This is a relatively small uk2gr corpus, needed for an extension of my SimpTrans
project.. take a look if you are interested in raw data for automatic
translation projects. The corpus features around 1000 different words in each
language, so it should be nice. It includes some computer-related text at the
top, which are sentence-aligned and some literature later on, which is
paragraph-aligned.

Paragraphs are denoted in a semi-standard HTML-like way
<P Lang=..>...stuff..</P>
denotes a paragraph in a language. Any number of translations can follow.

A short program will also appear that uses .ct files in various languages in
order to produce a fairly extensive computer-related aligned corpora. These
simple computer-related corpora should be quite enough for the limited scope of
translation of .ct files (?)

Look at other weird stuff from me:

/dev/basic/bgp.lha            -  Blitz Genetic-Programming Environment
/dev/basic/gasp.lha           -  Genetic Algorithm Sample Packer in Blitz
/dev/misc/SimpTrans.lha       -  Simple Automatic Translation of .ct files!
/dev/misc/ST_uk2gr.lha        -  Uk2Gr phrasebook for SimpTrans
/dev/misc/YAAV.lha            -  YAAV V0.99 - Yet Another Autodoc Viewer
/game/shoot/allrox.lha        -  HardCore FRENETIC shootem'up - HARD! FAST!
OUCH!
/game/actio/wreckage.lha      -  Experimental Driving Sim
/game/actio/wreckage_src.lha  -  Source for driving sim (older version :)
/util/pack/agf.lha            -  AGF V0.9 n*8-bit Sample Pre-Packing Processor


Olethros
--------


Contents of dev/misc/Uk2GrCorpus.lha
 PERMSSN    UID  GID    PACKED    SIZE  RATIO     CRC       STAMP          NAME
---------- ----------- ------- ------- ------ ---------- ------------ -------------
[generic]                20535   45061  45.6% -lh5- ab20 Jan  5  2000 uk2gr.corpus
---------- ----------- ------- ------- ------ ---------- ------------ -------------
 Total         1 file    20535   45061  45.6%            Jan  7  2000

Aminet © 1992-2024 Urban Müller and the Aminet team. Aminet contact address: <aminetaminet net>