Skip to main content
  • Log in
  • Manage Cookies
Eclipse Plugins, Bundles and Products - Eclipse Marketplace
  • My Marketplace
  • Add Content
  • More
      • Community

      • Marketplace
      • Events
      • Planet Eclipse
      • Newsletter
      • Videos
      • Blogs
      • Participate

      • Report a Bug
      • Forums
      • Mailing Lists
      • Wiki
      • IRC
      • Research
      • Eclipse IDE

      • Download
      • Learn More
      • Documentation
      • Getting Started / Support
      • How to Contribute
      • IDE and Tools
      • Newcomer Forum
    • Search

  1. Home
  2. Marketplace
  3. Tools
  4. LangID - a tool for language identification in SMILA 0.7

Please be aware that some listings have been temporarily delisted from our marketplace in order to improve the security of the platform. We apologize for any inconvenience this may caused.

For more information on this matter, please visit #1.

If you are a listing owner and would like to have your listing(s) reinstated, please open a ticket at https://gitlab.eclipse.org/eclipsefdn/it/websites/marketplace.eclipse.org/-/issues.

Thank you for your understanding and cooperation.

LangID - a tool for language identification in SMILA 0.7

LangID - a tool for language identification in SMILA 0.7
2
1

Details Group Tabs

Details

The LangID tool provides as main functionality the automatic language identification of any text provided as input. It is based on an n-gram approach to language identification and therefore it is very quick. It can distinguish among a number of 26 languages:

Catalan Croatian Czech Danish Dutch English Esperanto Estonian Finnish French German Hungarian Icelandic Indonesian Italian Latvian Lithuanian Malay Norwegian Portuguese Romanian Serbian Slovak Slovenian Spanish Swedish

The Language Identifier can be used to learn profiles for new languages based on a collection of language specific documents. It can detect the language of a document based either on the first 30 words or on its whole content. The precision of detecting the right language lies between 98% - 99,5%, depending on the profile size. The latency of the component is about 6ms when considering the first 30 words of a document.

Categories:
  • EclipseRT Target Platform Components
Tags:
  • SMILA,
  • language identification
Additional Details
Organization Name: 
DFKI GmbH
Development Status: 
Production/Stable
Date Created: 
Wed, 2011-06-08 11:33
License: 
Commercial
Date Updated: 
Tue, 2012-01-10 05:21
Submitted by: 
Bogdan Sacaleanu
Metrics
DateRankingInstallsClickthroughs
March 2023NA0 (0%)3
February 2023NA0 (0%)16
January 2023NA0 (0%)19
December 2022NA0 (0%)16
November 2022NA0 (0%)17
October 2022NA0 (0%)17
September 2022NA0 (0%)15
August 2022NA0 (0%)6
July 2022NA0 (0%)32
June 2022NA0 (0%)15
May 2022NA0 (0%)11
April 2022NA0 (0%)9
View Data for all Listings
Errors

Unsuccessful Installs in the last 7 Days: 0

Download last 500 errors (CSV)

Reviews Sign in to post reviews

hmartinezortega's picture

Gracias, como lo descargo?

Submitted by Hugo Alberto Martinez Ortega on Fri, 2012-11-30 09:49

Gracias, como lo descargo?

Markets

  • Eclipse Project (4)
  • Tools (1342)
    • Application Development Frameworks (171)
    • Application Management (32)
    • Application Server (31)
    • BIRT (10)
    • Build and Deploy (104)
    • Business Intelligence, Reporting and Charting (14)
    • Code Management (117)
    • Collaboration (37)
    • Database (44)
    • Database Development (34)
    • Database Persistence (17)
    • Documentation (60)
    • Eclipse Kura (122)
    • Eclipse SmartHome (46)
    • EclipseRT Target Platform Components (10)
    • Editor (339)
    • Entertainment (14)
    • General Purpose Tools (124)
    • Graphics (34)
    • IDE (344)
    • Internet of Things (IoT) (35)
    • J2EE Development Platform (44)
    • J2ME (5)
    • Languages (155)
    • Linux Tools (19)
    • Logging (22)
    • Mobile and Device Development (49)
    • Modeling (90)
    • Modeling Tools (146)
    • Mylyn Connectors (20)
    • Network (14)
    • Other (74)
    • Process (18)
    • Profiling (22)
    • Programming Languages (100)
    • Reporting (32)
    • Rich Client Applications (39)
    • Science (4)
    • SCM (20)
    • Search (42)
    • SOA Development (15)
    • Source Code Analyzer (99)
    • Systems Development (59)
    • Team Development (54)
    • Testing (90)
    • Tools (568)
    • UI (97)
    • UML (40)
    • Web (90)
    • Web Services (31)
    • Web, XML, Java EE and OSGi Enterprise Development (32)
    • XML (29)
  • IoT (172)
    • Eclipse Kura (122)
    • Eclipse SmartHome (46)
  • RCP Applications (79)
  • Training & Consulting (53)
  • Long Term Support (11)

Search

Advanced Search

More like this

  • MDParser- a tool for dependency parsing in SMILA 0.7
  • KeeWee- a tool for keyphrase extraction in SMILA 0.7
  • ConExt- a tool for conceptul hierarchy extraction in SMILA 0.7
  • Object Teams Development Tooling
  • Eclipse PDE (Plug-in Development Environment)

Favorited by

Eclipse Foundation

  • About Us
  • Contact Us
  • Sponsor
  • Members
  • Governance
  • Code of Conduct
  • Logo and Artwork
  • Board of Directors
  • Careers

Legal

  • Publishing Guidelines
  • Privacy Policy
  • Terms of Use
  • Copyright Agent
  • Eclipse Public License
  • Legal Resources

Useful Links

  • Welcome to Marketplace
  • Report a Bug
  • Documentation
  • How to Contribute
  • Mailing Lists
  • Forums
  • Marketplace

Other

  • IDE and Tools
  • Projects
  • Working Groups
  • Research@Eclipse
  • Report a Vulnerability
  • Service Status

Copyright © Eclipse Foundation. All Rights Reserved.

Back to the top