Data Quality - Name and Address Management and Cleansing

Dataquality Metrics

About

Geocoding is an important part of the name and address management and sometimes the same with the only difference that Geocoding don't care of the standardization of the name.

Name and Address Cleansing

Name and address cleansing is a group of transformations performed on data containing individual and business names, as well as domestic and foreign addresses, for the purpose of improving the quality of data. Such transformations are usually referred to as parsing, standardization, correction, and augmentation.

  • Name/address parsing is the breakdown of non-discrete input into discrete name or address components.
  • Name/address standardization is the modification of components to a standard version acceptable to a postal service or suitable for record matching. For example ‘Street’ and ‘Str’ are standardized to ‘St’.
  • Postal correction involves matching an input address with postal database entries to verify and/or correct an address.
  • Augmentation adds derived information to the data, such as gender based on name, or collection of census, and geo-location data.

Patterns

Name and Address parsing, like any other type of parsing, depends on identification of keywords and patterns containing those keywords for instance :

  • Streets
    • Building numbers
    • Building names
    • Post boxes
    • Street names
    • Street junctions
    • Industrial Zones
    • Route Numbers (i.e.: major roads)
  • Areas
    • Multiple areas
  • Postcode

Standard

Development





Discover More
Dataquality Metrics
Data Quality - Data Rules

Data rules are rule that can have various designations such as: business rules (in the data modeling), data test, quality screen. They follow the same concept than the rules from an event driven...
Dataquality Metrics
Data Quality - Verification with an external directory website (Scrapping)

The data quality often must use external database to control the validation of the data. It's often the case with the address cleaning. And what better tools that all the data that you can find on the...
Compiler
Lexical Analysis - Parser (Syntax analysis|Linter)

A parser create a parse tree data structure from a series of token created by the lexer. The creation of the tree is based on the rules declared in the grammar (which defines the syntactic structure of...
Card Puncher Data Processing
ODI - Component Details

Oracle Data Integrator is made up of the components described below: The modular Repository, made up of a Master Repository and of one or several Work Repositories. These repositories can be installed...
Card Puncher Data Processing
ODIEE - Oracle Data Quality

Oracle Data Quality for Oracle Data Integrator is a data quality platform. Its powerful rule-based engine and its robust and scalable architecture places data quality and name & address cleansing...
Name And Adress Result
OWB - Name and Adress Operator

The name and address operator is a component of the name and address management. You say : what you know on your data what you want as parse data Select one of the following parsing types:...



Share this page:
Follow us:
Task Runner