New York Times Article Search API extractor
Wandora's New York Times Article Search API extractor performs an api request to New York Times Article Search API and transforms the response JSON to topics and associations. After successful extraction Wandora contains information about New York Times articles and extracted information can be visualized, exported and modified in Wandora.
In this example, Wandora user performs an article search to New York Times Article Search API with a query obama. First, user locates the extractor in File > Extract > News > New York Times Article Search API extractor and selects the menu option.
Wandora opens up an extractor dialog. Wandora user fills in a search query and selects fields to be extracted. Listed fields are a subset of fields described in New York Times Article Search API documentation. We experienced some difficulties in extracting too many fields: The number of result rows in response dropped down to one instead of default ten. Notice, you can always perform sequential extractions with a different set of fields. As Wandora merges article topics automatically, latter results merge nicely with earlier results. In this example user selects to extract fields author, body, byline, column_facet, date and title. Article url is added to the selected fields automatically by Wandora. Article url is used as a subject identifier of an article topic.
Wandora user clicks the Extract button and Wandora asks user's api-key. Wandora keeps your api-key stored in memory during a use session. Your api-key is not stored between use sessions. If you need to change the api-key during a use session, press the Forget api-key button in query dialog. Wandora asks the key again once you perform an extraction.
If Wandora gets a valid response, the application parses response JSON and transforms included entities to topics and associations. One response JSON contains information about 10 articles. Wandora notices if there is more articles available and asks how to proceed.
Available options are
- Do not extract any more pages.
- Extract only next page
- Extract next page
- Extract 10 next pages
- Extract all next pages
Wandora user should notice that the Article Search API usage is limited to 5000 requests per day and extracting one page takes one request. Thus, use option Extract all next pages carefully.
After successful extraction, Wandora user finds a topic New york Times API just below Wandora class. Opening the topic reveals extracted articles and used type topics.
Opening single article topic in Traditional topic panel views all information (i.e. selected fields) associated with the article.
For example, article body is modeled as an occurrence attached to the article topic.
By-lines are modeled as topics associated with the article topic. Similarly, facets are generally modeled as topics associated with the article topic.
All facet topics are typed with article-facet topic.
Opening one facet topic, say Letter, lists all articles that have the facet value.
Once you have performed an article search using the New York Times Article Search API extractor, you might be interested in classifying articles: