First page Back Continue Last page Image

INormalizer

class INormalizer(Interface):

"""Provides method to normalize lists of tokens"""

def getNormalizedTokenItemList(token_list, sample_url):

"""Returns a list of key, value pairs representing the

normalized tokens using a specific normalizer.

sample_url –- a URL to a text input which defines

the input sample to learn from, eventually

in the form of a URL to a web service

"""


Notes:

The INormalizer interface is similar to the IClassifier interface. It uses the same way to initialize machine learning. It returns a list of normalized colum names, or list of normalized tag values. The concept of “normalized tag” refers to the idea that synonims should be unified into the sale value.