OpenCalais classifier

From WandoraWiki
(Difference between revisions)
Jump to: navigation, search
 
(11 intermediate revisions by one user not shown)
Line 1: Line 1:
 +
'''OpenCalais classifier has been retired in Wandora version 2015-08-06 after Calais' API changes.'''
 +
 
{| cellspacing="0" cellpadding="0" width="100%" background-color="transparent"  
 
{| cellspacing="0" cellpadding="0" width="100%" background-color="transparent"  
 
| style="border:none; margin:none; padding:0px;" valign="top" width="120" | <!-- LEFT COLUMN -->
 
| style="border:none; margin:none; padding:0px;" valign="top" width="120" | <!-- LEFT COLUMN -->
Line 5: Line 7:
 
OpenCalais classifier takes a text fragment and creates a topic for it, and associates created text topic with keywords given by [http://www.opencalais.com/ OpenCalais] web service. The effect is that given text is tagged with certain topics found in the text.
 
OpenCalais classifier takes a text fragment and creates a topic for it, and associates created text topic with keywords given by [http://www.opencalais.com/ OpenCalais] web service. The effect is that given text is tagged with certain topics found in the text.
  
Wandora's OpenCalais classifier starts with '''File > Extract > Classify > OpenCalais classifier...'''. Classifier can read given file, URL, or plain text.
+
Wandora's OpenCalais classifier starts with '''File > Extract > Classification > OpenCalais classifier...'''. Classifier can read given file, URL, or plain text.
  
[[Wandora Firefox plugin|Wandora's Firefox plugin]] can also access the OpenCalais classifier. You can send complete WWW page or just WWW page fragments to Wandora for OpenCalais classification while browsing the web.
+
[[Wandora Firefox plugin|Wandora's Firefox plugin]] can also access the OpenCalais classifier. You can send complete WWW page or just WWW page selections to Wandora for OpenCalais classification while browsing the web.
 
|}
 
|}
  
 
== OpenCalais classification example ==
 
== OpenCalais classification example ==
  
In our example user has started Wandora application and [[Embedded HTTP server|Wandora embedded HTTP server]]. Then user browses The New York Times web site and finds interesting text fragment discussing Senator Obama's speech. User selects the text fragment as shown below and starts Wandora Firefox plugin with option Classify with OpenCalais.
+
Although OpenCalais classifier works also within Wandora application, this example demonstrates OpenCalais classifier in Wandora's web browser extension. In our example user has started Wandora application and [[Embedded HTTP server|Wandora's embedded HTTP server]]. Then user browses The New York Times web site and finds interesting text fragment discussing Senator Obama's speech. User selects the text fragment (See image below) and starts Wandora Firefox plugin with option '''Classify with OpenCalais'''.
  
  
Line 20: Line 22:
 
Plugin sends the text to Wandora's OpenCalais classifier. Classifier passes the text to OpenCalais web service and receives keywords found in the text. Finally Wandora creates topics for
 
Plugin sends the text to Wandora's OpenCalais classifier. Classifier passes the text to OpenCalais web service and receives keywords found in the text. Finally Wandora creates topics for
  
* The text fragment
+
* The text selection
 
* WWW page the text fragment originates
 
* WWW page the text fragment originates
 
* Each keyword provided by the OpenCalais
 
* Each keyword provided by the OpenCalais
Line 28: Line 30:
  
 
[[Image:opencalais_example_2.gif|center]]
 
[[Image:opencalais_example_2.gif|center]]
 
  
 
== Additional notes ==
 
== Additional notes ==
Line 35: Line 36:
 
* OpenCalais classifier accepts only limited size texts.
 
* OpenCalais classifier accepts only limited size texts.
 
* OpenCalais classifies properly only English language texts.
 
* OpenCalais classifies properly only English language texts.
 +
 +
== See also ==
 +
 +
* [[Tagthe extractor]]
 +
* [[Yahoo! YQL term extractor]]
 +
* [[SemanticHacker classifier]]
 +
* [[AlchemyAPI extractors]]
 +
* [[Zemanta extractor]]
 +
* [[GATE/ANNIE integration|GATE/ANNIE]]
 +
* [[Stanford Named Entity Recognizer integration|Stanford Named Entity Recognizer (NER)]]
 +
* [[UClassify integration]]

Latest revision as of 11:54, 6 August 2015

OpenCalais classifier has been retired in Wandora version 2015-08-06 after Calais' API changes.

calais_logo_final_thomson_web_100.jpg

OpenCalais classifier takes a text fragment and creates a topic for it, and associates created text topic with keywords given by OpenCalais web service. The effect is that given text is tagged with certain topics found in the text.

Wandora's OpenCalais classifier starts with File > Extract > Classification > OpenCalais classifier.... Classifier can read given file, URL, or plain text.

Wandora's Firefox plugin can also access the OpenCalais classifier. You can send complete WWW page or just WWW page selections to Wandora for OpenCalais classification while browsing the web.

[edit] OpenCalais classification example

Although OpenCalais classifier works also within Wandora application, this example demonstrates OpenCalais classifier in Wandora's web browser extension. In our example user has started Wandora application and Wandora's embedded HTTP server. Then user browses The New York Times web site and finds interesting text fragment discussing Senator Obama's speech. User selects the text fragment (See image below) and starts Wandora Firefox plugin with option Classify with OpenCalais.


Opencalais example 1.gif


Plugin sends the text to Wandora's OpenCalais classifier. Classifier passes the text to OpenCalais web service and receives keywords found in the text. Finally Wandora creates topics for

  • The text selection
  • WWW page the text fragment originates
  • Each keyword provided by the OpenCalais

and links everything with reasonable associations. Below is a screenshot of Wandora after successful classification. It appears OpenCalais recognized keywords Obama and McCain in the given text fragment.


Opencalais example 2.gif

[edit] Additional notes

  • The quality of keywords provided by OpenCalais varies. Sometimes OpenCalais doesn't recognize obvious keywords in given text. Sometimes provided keyword set is very comprehensive.
  • OpenCalais classifier accepts only limited size texts.
  • OpenCalais classifies properly only English language texts.

[edit] See also

Personal tools