Some users have had some problems installing Cysgliad on some Windows 7 machines.
An update that resolves all these issues is now available for download from the Cysgliad website.
Log in to the Cysgliad website with your Cysgliad website account details and follow the links to the download the installer. If you don’t have or can’t remember your login details then click on the “Forgotten your login details?” link.
At the same time Google added Afrikaans, Belarussian, Iclandic, Irish, Macedonian, Malayan, Swahili and Yiddish. This brings the total amount of languages supported by Google Translator to 51.
These translations are produced by statistical approaches to machine translation - that is, Google uses the vast amounts of text on the web in order to find patterns between the words, and by using parallel texts of different languages are able to produce what appear to be translations.
The technology isn’t perfect yet, and one of our worries is that unscrupulous organisations and companies will misuse the technology to produce inferior Welsh language translations. Some organisations that claim conformance to the Welsh Language Act are likely to be very tempted to use a free automatic service such as this without a care for the legibility and linguistic correctness of their texts in Welsh. This of course was not the intended purpose of Google Translate : its aim is to provide a gist translation that allows a user to read and understand texts that are in otherwise unaccessable languages, and also to provide initial rough translation for a human translator to work on before being properly published.
However, Google have achieved quite a feat with their automatic translation for Welsh to English. Translating from English to Welsh is not as good, but it looks promising. The ability to translate between Welsh and other languages is a very useful feature, since the service translates between all possible language pairs of the 51 supported languages. It appears English is used as an intra-language in the translating, translating from Chinese for example shows the English word ordering in the Welsh translations.
We hope that this is only the beginning for Google and that further improvements will come. This could be good news for the translation industry in Wales. As we know, the demand for translations between Welsh and English is virtually inexhaustible. Improved tools for initial rough translations will allow translators to translate more texts in less time. This will keep costs down to reasonable limits. It could also mean that there could be no need to translate from Welsh to English anymore, since a gist translation might be sufficient. Will we still need to translate our own Murmur blog entries from Welsh to English from now on? We’ll see – but in the meantime, very warm congratulations to Google on their first efforts with Welsh-language translation.
We are very happy at the news that our Unit has been awarded a two year grant to establish a special interest group between industry and academia in Wales in the field of Speech and Language Technologies. (SALT)
The project will provide a boost not only to existing research and development in SALT but also to their commercial exploitation by Welsh industry and businesses both within a bilingual Wales and the multilingual world beyond.
We shall be assisting companies with an interest in the developing their products and services further with Speech and Language Technologies, and will be looking to identify projects which can be undertaken together with Welsh academia. The unit will also aid businesses in applying for funding from appropriate funding sources – a role that our unit has successfully undertaken many times in the past.
In addition the special interest group will allow us to organize a number of seminars for business and academia, where both international speakers and experts from Welsh universities will be invited to share from their expertise.
Keep an eye on this blog for detail of the launch event next month!
The Special interest group’s website is : http://www.saltcymru.org
Press Release : Wales a Living Laboratory
Speech and Language Technologies in the Clouds
Challenges and Opportunities from Cloud Computing
Guest speaker: Dewi Jones from the Language Technologies Unit Bangor University
Tuesday 3rd March
6.00pm – 8.00pm
(Doors open 5.30pm for refreshments)
There are a confusing number of definitions for ‘Cloud Computing’, some use ‘cloud’ as a metaphor for the internet as an abstraction of complex infrastructure that it conceals. The ‘cloud’ incorporates many recent software technology trends and buzzwords such as Software as a Service (SaaS) and ‘Web 2.0′ (even Web 3.0) in which a common theme is reliance on the Internet (rather than the personal computer) for satisfying the computing needs of users.This approach to delivery of applications and data storage has its fans and critics.
Dewi will present the challenges and opportunities cloud computing could have in store for their speech and language technologies research and development work at Bangor. Demonstrations of experimental web based applications, developed at the Unit, will be used as a basis for illustration and discussion.
The event is FREE to attend, however advance booking is essential
To book a place please email firstname.lastname@example.org
Alternatively you can contact Rob or Judith on 01248 388245
Following our update for Cysill 3.0, there is now an update for Cysgeir 1.0 available. If Cysgeir hasn’t notified you itself of the update, then choose from the main menu
- Help -> Check for Updates
- View -> Options -> Updates -> Check Now
In this Cysgeir update :
- enhancments to how you receive new an/or updated dictionaries
- startup speed of Cysgeir improved
- support for OpenOffice.org 3.0
After you’ve successfully updated the Cysgeir application, choose from the main menu:
- Help -> Check for Dictionary Updates
There are some corrections to the Cysgair dictionary (the comprehensive general language dictionaryin Cysgeir), a new version of the Termiadur (which supersedes Y Termiadur Ysgol) and the following new terminology dictionaries :
- Terms in the Health Care of Older People
- Terms for Occupational Therapy
An update for the home edition of Cysill 3.0 is now available. If Cysill hasn’t notified you itself of the update, then choose from the main menu
- Help -> Chwilio am ddiweddariad
In this update are
- additions to Cysill’s spelling lexicon and the grammar rules database.
- problems caused by question marks resolved. atryswyd problemau gyda marciau cwestiwn mewn dogfennau
- startup speed of Cysill improved
- support for OpenOffice.org 3.0
It’s been a busy few weeks for us. First of all we hosted the very succesful Global Understanding in Multilingual, Multimodal and Multimedia Contexts (GUM3C) conference here at Bangor. All the presentations were well-received, and our two keynote speakers, Christian Galinski and David Crystal, gave outstanding talks. The papers have been published on-line on the GUM3C website at http://www.GUM3C.org.
Last week Delyth Prys spoke at a conference on Minority Languages in Bilbao, the Basque Country. Like our own GUM3C Conference this was organised to celebrate the UNESCO Year of Languages, and this was another chance to meet with like-minded people from other countries. Meeting and talking to people engaged in developing Basque language resources was a special privilage, and we hope that we may work together in future on a joint Welsh/Basque software project. A special mention is due here to Luistxo Fernandez who also spoke at the conference and who keeps a bilingual Basque/English blog at http://eibar.org/blogak/luistxo/en. The Basque UNESCO conference website is at http://www.hizkuntzagutxituak.net/.
Moving on to a conference which is happening next month, we are looking forward to the LULCLII conference at Bolzano on the Italian/Austrian border (where Otzi the iceman is kept in a purpose-built museum). It is a very rare thing to find an academic conference entirely dedicated to Comuptational Linguistics for Lesser Used Languages, and four of our team attended the first such conference at Bolzano in 2005. Some of us hope to attend again this time, and the first draft of the conference programme has now been posted at http://www.eurac.edu/lulcl2. We’ll let you know how we get on there.
Two new terminology dictionaries which we helped compile were launched in the National Eisteddfod at the at the start of the month.
The Dictionary of Legal Terms is the result of collaboration between the Language Technologies Unit and the School of Law, Bangor University. It’s aimed principally at students studying the law through the medium of Welsh at university level, although it will also be useful to those studying the law at secondary school level, to members of the legal profession practising through the medium of Welsh, to translators and bilingual administrators, and to everyone interested in the law and the Welsh language. The dictionary contains over 1,000 terms with concise definitions in Welsh and English.
Geiriadur Termau’r Gyfraith / Dictionary of Legal Terms, ed. Dewi Llŷr Jones, Delyth Prys, Owain Lloyd Davies. Bangor University, 2008.
ISBN 978-1-84220-110-7. R.R.P. £5.
The Dictionary of Terms for the Creative Industries is the result of collaboration between the Language Technologies Unit and the Creative Industries Network Panel, Centre for Welsh Medium Higher Education. The first part of the dictionary, which deals chiefly with the field of New Media, is now available on-line at http://www.termau.org/cyfryngau and the aim is to add further entries to this on-line dictionary in the future.
The slides from the presentation delivered at ITWales are now available from the SALT Cymru website by following this link this link.
Thanks to all for the encouraging response, and we hope that our answers to your questions were useful!
I recently attended the recent LREC conference (”Language Resources and Evaluation Conference”), held in Marrakech, Morocco, May 28-30, and was involved in the following:-
- Organising and chairing the half-day workshop on “Collaboration: interoperability between people in the creation of language resources for less-resourced languages” on May 27th.
- Presenting a poster paper (co-authored with Rhys) on “Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language” (i.e. Welsh).
- Giving a short presentation to a post-conference workshop, speaking about the recent activities of the SALTMIL Special Interest Group.
I’ve also written a long review of the “EMU” speech database management software. The review has just been published in the purely-online journal “Language Documentation and Conservation“, and can be read online here. The software is relevant to less-resourced languages of all kinds.