Language Detection

Table of contents

Description

This source offers the detection of the language of contents.

Responsible Person

Status

Status

Ready for use

Possible Values: Ready for Use, In Development, Plannend, Broken

Info

Aggregation Features

Fill

No

Update

No

Enrich

Yes

Source Code

Project: https://github.com/soziotech/CommunityMashup/tree/master/sources/LanguageDetectionSourceService

Tracking

0%

Planned Todos

  1. handler

    Add adapter to react on new contents.

    Priority MEDIUM
    i21bpela
    Jul 23, 2012
0%

Bugs

0%

Feature Request

Used Service

Name

Detect language

Link

http://detectlanguage.com/

Developer Portal

http://detectlanguage.com/

Authentication

Api Key

Limits:

5000 calls/day. Pro accounts available.

 

Used Libraries

Name

Version

License

Usage

Link

Apache Http Client

4.2.1

Apache License 2.0

Compiled

http://hc.apache.org/httpcomponents-client-ga/index.html

Apache Http Core4.2.1Apache Licence 2.0Compiledhttp://hc.apache.org/httpcomponents-core-ga/
Google gson2.2.2Apache Licence 2.0Compiledhttp://code.google.com/p/google-gson/

Possible Usage Values: Compiled, Source, Extended Source, Modified Source

Info

Configuration

Key

Description

Possible Values

Default

Required (Yes/No)

apiKey

This property is used to set the api key of the authentication at detectlanguage.com.

Text

empty

Yes

setLocaleDetermines if the locale attribute of the content should be set.true/falsetrueNo
languageMetaTagDetermines if the language should be added as meta tag.true/false No
minimumConfidenceThe minimum confidence needed for a language detection result to be accepted.0.0 .. 1.00.75No
neededMetaTagOnly contents with this tags will be processed.TextemptyNo

Example:

<sources bundleId="org.sociotech.communitymashup.source.LanguageDetectionSourceService" name="DetectLanguage">
   <configuration>
      <properties Key="apiKey" Value="-" />         
   </configuration>
</sources>

Description

The language of all contained contents will be detected, added as locale attribute as well as meta tag with the default confidence of 75%.

Meta-Informations

Type

Description

Value

Items

MetaTag

Added to all contents for wich the language could be detected with the given confidence.

language_detected

Contents

MetaTag

Added to all contents for wich the language could not be detected with the given confidence.

language_undetectedContents

Transformation

External

CommunityMashup

Language

  • Locale Attribute of Content
  • MetaTag added to Content