Logical Data Modeling - Entity Set (Class, Table)

Data System Architecture

About

An entity set is a set of entity.

An entity set is implemented:

  • in a relational database by a table and an entity by a row
  • in code by a Class and an instance by an object

Erd Entity Instance

Not to confound with the entity of a language

Categories/classification of entities

At a high level:

  • transaction entities correspond to fact tables
  • component entities correspond to dimension tables
  • classification tables correspond to hierarchy tables

Transaction/Facts Entities

These entities record details of business events (e.g., orders, shipments, payments, insurance claims, bank transactions, hotel bookings, airline reservations, and hospital admissions).

Generally, this tables are at the many end of a all one-to-many relationship.

Fact Dimension / Master detail transaction table

One exception to this rule is a fact dimension which has a one-to-many relationship (master detail) with the facts entities but still describe business events.

For instance, an Order Header and an Order Item table describe the same business event but with different levels of detail. The Order item table has the more detailed data whereas the Order header is grouped by order number and therefore its attributes can be in a dimensional model degenerated

This transaction entities are connected in a master-detail structure, they should be combined into a single fact table, as they represent different views of the same event. The split into “master” and “detail” is simply a requirement of normalization (1NF).

All attributes of the master record should be allocated down to the item level if possible (Kimball, 1995). For example, if discount is defined at the master (Order) level, the total discount amount should be allocated at the item level (e.g., in proportion to the price for each item). The same should be done to delivery charges, order level taxes, fees, etc.

Snapshot Entities

This entities recording a static level of some commodity at a point in time (e.g., account balances, customer sold and inventory levels). These do not record business events as such, but the effect of events on the state of an entity.

For instance, a stock level table which records a static level.

Component Entities

These entities are directly related to a transaction entity by a one-to-many relationship.

Classification / Lookup Entities

These entities are related to a component entity by a chain of one-to-many relationships. These define embedded hierarchies in the data model and are used to classify component entities.

Others

Some of the entities do not fit into any of these categories. Such entities do not fit the hierarchical structure of a dimensional model and therefore cannot be represented in the form of a star schema and must be “dimensionalized”.

Dates and Times Dimension

Date and/or Time appear as explicit dimensions in most star schemas to support different types of historical analyses. These are not normally represented as entities in operational systems.

Date and Time are usually represented as separate dimensions to reduce the size of the dimension tables (Kimball, 1996).

Dates must be explicitly modeled in a dimensional schema, whereas at the operational level they are represented as data types.

Other Notations

Mathematical Notation: A set

The mathematical concept of a set is just a group of unique items, meaning that the group contains no duplicates.

Programming language: A collection

A (Collection|Container) Data Type (Set, Bag, Sequence)

Naming Convention

  • The name of a entity is often plural (for example, owners, pets, specialities, and so on).





Discover More
Relational Data Model
(Relation|Table) - Tabular data

A Relation is a logical data structure composed of tuple (row) attribute (column, field) The following data structure are a relation: a table, a materialized view (query) (store data) a query,...
Card Puncher Data Processing
Business - Rule

A business rule is a rule that constrains some aspect of the business process. According to the white paper by the Business Rules Group, a business...
Dataquality Metrics
Data Quality - Entity (Resolution|Disambiguation) - Record (linkage|matching) - Conflation

Entity Resolution, or Record linkage is the process of (joining|matching) records from one data source with another that describe the same Entity. Also known as : entity disambiguation/linking, ...
Utah Teapot
Data Visualization - ( Chart | Graph | Graphic)

A chart, also called a graph (in mathematics for instance), is a graphical representation of data, in which the data is represented by symbols, such as: bars in a bar chart, lines in a line chart,...
Utah Teapot
Data Visualization Foundation

The Utah Teapot. Data visualization is the process of mapping quantitative data to visuals (shapes, color, position, etc) to create a graph made of geometric object. Information visualization: ...
Scd Type2
Dimensional Data Modeling - Slowly Changing Dimensions (SCD)

Tracking changes in dimension is referred as slowly changing dimensions. It contains data history In the source system a lot of changes are daily made : new customers are added, addresses are modified,...
Ebs Flexfield
EBS - Flexfield

A flexfield is a field made up of sub-fields, or segments. A flexfield appears on your form as a pop-up window that contains a prompt for each segment. Each segment has: a name and a set of valid...
Identity

An identity is the identifier of an entity. (id + entity = identity) The key is a mathematical entity, while the owner – and the connection between owner and key – are not. identityidentifier (key)...
Card Puncher Data Processing
Language - (Program) Entity - Construct

A entity: have a name have a scope is created by a declaration statement is stored in a variable have a value The scope and name of the entity are theidentity of the entity. See See also:...
Card Puncher Data Processing
Language - Object

Objects are values that hold other values at specific named locations called properties (Property in this context may be a method or a variable) An object can be seen as a single data structure that contains:...



Share this page:
Follow us:
Task Runner