watson_developer_cloud.natural_language_understanding_v1 module¶

Analyze various features of text content at scale. Provide text, raw HTML, or a public URL and IBM Watson Natural Language Understanding will give you results for the features you request. The service cleans HTML content before analysis by default, so the results can ignore most advertisements and other unwanted content. You can create [custom models](/docs/services/natural-language-understanding/customizing.html) with Watson Knowledge Studio to detect custom entities and relations in Natural Language Understanding.

class NaturalLanguageUnderstandingV1(version, url='https://gateway.watsonplatform.net/natural-language-understanding/api', username=None, password=None, iam_apikey=None, iam_access_token=None, iam_url=None)[source]¶

Bases: watson_developer_cloud.watson_service.WatsonService

The Natural Language Understanding V1 service.

default_url = 'https://gateway.watsonplatform.net/natural-language-understanding/api'¶

analyze(features, text=None, html=None, url=None, clean=None, xpath=None, fallback_to_raw=None, return_analyzed_text=None, language=None, limit_text_characters=None, **kwargs)[source]¶

Analyze text, HTML, or a public webpage.

Analyzes text, HTML, or a public webpage with one or more text analysis features, including categories, concepts, emotion, entities, keywords, metadata, relations, semantic roles, and sentiment.

Parameters:	features (Features) – Specific features to analyze the document for. text (str) – The plain text to analyze. One of the text, html, or url

parameters is required. :param str html: The HTML file to analyze. One of the text, html, or url parameters is required. :param str url: The web page to analyze. One of the text, html, or url parameters is required. :param bool clean: Remove website elements, such as links, ads, etc. :param str xpath: An [XPath query](https://www.w3.org/TR/xpath/) to perform on html or url input. Results of the query will be appended to the cleaned webpage text before it is analyzed. To analyze only the results of the XPath query, set the clean parameter to false. :param bool fallback_to_raw: Whether to use raw HTML content if text cleaning fails. :param bool return_analyzed_text: Whether or not to return the analyzed text. :param str language: ISO 639-1 code that specifies the language of your text. This overrides automatic language detection. Language support differs depending on the features you include in your analysis. See [Language support](https://www.bluemix.net/docs/services/natural-language-understanding/language-support.html) for more information. :param int limit_text_characters: Sets the maximum number of characters that are processed by the service. :param dict headers: A dict containing the request headers :return: A DetailedResponse containing the result, headers and HTTP status code. :rtype: DetailedResponse

delete_model(model_id, **kwargs)[source]¶

Delete model.

Deletes a custom model.

Parameters:	model_id (str) – model_id of the model to delete. headers (dict) – A dict containing the request headers
Returns:	A DetailedResponse containing the result, headers and HTTP status code.
Return type:	DetailedResponse

list_models(**kwargs)[source]¶

List models.

Lists available models for Relations and Entities features, including Watson Knowledge Studio custom models that you have created and linked to your Natural Language Understanding service.

Parameters:	headers (dict) – A dict containing the request headers
Returns:	A DetailedResponse containing the result, headers and HTTP status code.
Return type:	DetailedResponse

class AnalysisResults(language=None, analyzed_text=None, retrieved_url=None, usage=None, concepts=None, entities=None, keywords=None, categories=None, emotion=None, metadata=None, relations=None, semantic_roles=None, sentiment=None)[source]¶

Attr str language:
	(optional) Language used to analyze the text.
Attr str analyzed_text:
	(optional) Text that was used in the analysis.
Attr str retrieved_url:
	(optional) URL that was used to retrieve HTML content.
Attr Usage usage:
	(optional) API usage information for the request.
Attr list[ConceptsResult] concepts:
	(optional) The general concepts referenced or

Attr float score:
Attr str label:	(optional) The path to the category through the taxonomy hierarchy.
	(optional) Confidence score for the category classification. Higher

Attr float relevance:
Attr str text:	(optional) Name of the concept.
	(optional) Relevance score between 0 and 1. Higher scores

Attr str dbpedia_resource:
Attr str name:	(optional) Common entity name.
	(optional) Link to the corresponding DBpedia resource.
Attr list[str] subtype:
	(optional) Entity subtype information.

Attr bool mentions:
Attr int limit:	(optional) Maximum number of entities to return.
	(optional) Set this to true to return locations of entity

Attr float relevance:
Attr str type:	(optional) Entity type.
Attr str text:	(optional) The name of the entity.
	(optional) Relevance score from 0 to 1. Higher values indicate

Attr list[int] location:
Attr str text:	(optional) Entity mention text.
	(optional) Character offsets indicating the beginning and

Attr bool sentiment:
Attr int limit:	(optional) Maximum number of keywords to return.
	(optional) Set this to true to return sentiment information

Attr list[Author] authors:
	(optional) The authors of the document.
Attr str publication_date:
	(optional) The publication date in the format ISO 8601.
Attr str title:	(optional) The title of the document.
Attr str image:	(optional) URL of a prominent image on the webpage.
Attr list[Feed] feeds:
	(optional) RSS/ATOM feeds found on the webpage.

Attr str status:
	(optional) Shows as available if the model is ready for use.
Attr str model_id:
	(optional) Unique model ID.
Attr str language:
	(optional) ISO 639-1 code indicating the language of the model.
Attr str description:
	(optional) Model description.

Attr list[SemanticRolesEntity] entities:
Attr str text:	(optional) Text that corresponds to the subject role.
	(optional) An array of extracted entities.
Attr list[SemanticRolesKeyword] keywords:
	(optional) An array of extracted keywords.

Attr str normalized:
Attr str text:	(optional) Analyzed text that corresponds to the action.
	(optional) normalized version of the action.
Attr SemanticRolesVerb verb:
	(optional)

Attr list[SemanticRolesKeyword] keywords:
Attr str text:	(optional) Object text.
	(optional) An array of extracted keywords.

Attr bool keywords:
Attr int limit:	(optional) Maximum number of semantic_roles results to return.
	(optional) Set this to true to return keyword information for

Attr str text:	(optional) The keyword text.
Attr str tense:	(optional) Verb tense.

Attr DocumentSentimentResults document:
	(optional) The document level sentiment.
Attr list[TargetedSentimentResults] targets:
	(optional) The targeted sentiment to

Attr EmotionScores emotion:
Attr str text:	(optional) Targeted text.
	(optional) An object containing the emotion results for

Attr float score:
Attr str text:	(optional) Targeted text.
	(optional) Sentiment score from -1 (negative) to 1 (positive).

Attr int features:
	(optional) Number of features used in the API call.
Attr int text_characters:
	(optional) Number of text characters processed.
Attr int text_units:
	(optional) Number of 10,000-character units processed.