Infogoal Logo
GOAL DIRECTED LEARNING
Master DW

Data and Analytics Tutorial

Data and Analytics Overview
Under Construction

Data and Analytics Success

Data and Analytics Strategy
Project Management
Data Analytics Methodology
Quick Wins
Data Science Methodology

Requirements

BI Requirements Workshop

Architecture and Design

Architecture Patterns
Technical Architecture
Data Attributes
Data Modeling Basics
Dimensional Data Models

Enterprise Information Management

Data Governance
Metadata
Data Quality

Data Stores and Structures

Data Sources
Database Choices
Big Data
Atomic Warehouse
Dimensional Warehouse
Logical Data Warehouse
Data Lake
Operational Datastore (ODS)
Data Vault
Data Science Sandbox
Flat Files Data
Graph Databases
Time Series Data

Data Integration

Data Pipeline
Change Data Capture
Extract Transform Load
ETL Tool Selection
Data Warehoouse Automation
Data Wrangling
Data Science Workflow

BI and Data Visualization

BI - Business Intelligence
Data Viaulization

Data Science

Statistics
Descriptive Analytics
Predictive Analytics
Prescriptive Analytics

Test and Deploy

Testing
Security Architecture
Desaster Recovery
Rollout
Sustaining DW/BI

Database Technology for Data and Analytics

Select and Use the Right Database Technology for Your Data Warehousing and Business Intelligence Project

Which relational database should you choose for the staging, data warehouse and data mart databases that make up the data warehousing system? In addition, which version and feature set are the best choice? As you can see by this data warehouse architecture diagram there are a number of databases involved:

Data Warehousing Technical Architecture

First look at your goals and objectives. Requirements you may consider when evaluating include:

Performance features to consider:

Financial Considerations:

There are a number of databases that you may want to consider. The following databases and their respective vendors are market leaders that you are likely to evaluate:

There are other very good database systems such as Sybase-IQ and Postgres.

Data and Analytics with DB2

DB2 from IBM is definitely an alternative to consider if your organization runs an IBM shop with AIX or z/OS operating system.

Strengths

Weaknesses

  • Strong on IBM platforms (AIX and Z/OS)
  • Highly scalable
  • Integrates with related IBM offerings
  • Supported by IBM and partners
  • Good installed base
  • Not as strong on Linux and Windows
  • Add on cost is high

 


DB2 has a number of versions and supports multiple platforms. The Express-C version of the system differs from other express versions in that there is no limit on database size. The Data Warehouse Edition (DWE) packages DB2 Enterprise Server along with:

IBM has developed a means of speeding queries by storing pre-computing values called Materialized Query Tables (MQT).  This helps DB2 to be highly scalable.

Data and Analytics with MySql

MySQL from Sun Software is the world's most popular open source DBMS. It has proven itself in many situations with successes with companies like LinkedIn, Dow Corning and the NBC Olympics.

Strengths

Weaknesses

  • Many users
  • Low cost
  • Scalable
  • Multi-platform
  • Many partners
  • Many tools
  • Easy to use
  • Supported by Sun Software
  • Features may be slow in coming
  • OLAP is not provided by Sun

 


With MySQL, you can start with the Community Edition and then upgrade to the Enterprise Edition for production work.

Business Intelligence with Oracle

Oracle provides an industry leading relational database plus a suite of business intelligence products.

Strengths

Weaknesses

  • Data warehousing and business intelligence features available
  • Very scalable
  • Multi-platform
  • Many partners
  • Many tools
  • Supported by Oracle
  • High price
  • Add-ons cost extra

 

Oracle does provide a free Express version, so it is possible to start at a low cost.

Data and Analytics with SQL Server

SQL Server 2005 from Microsoft is a great choice when the environment is Windows and moderate scalability is required.

Strengths

Weaknesses

  • Ease of use
  • High customer satisfaction
  • Strong Microsoft supprt
  • Widely used
  • Availability of people
  • Scales up to 3 TB
  • ETL suite included
  • OLAP suite included
  • Reporting suite included
  • Low TCO compared to other commercial software
  • Limited to Windows platform
  • Scalability beyond 3 TB is a challenge

 

SQL Server 2019 has a number of versions ranging in price from free to over $25,000. If you want to learn SQL Server or do a proof of concept, the Developer version is a bargain. Keep in mind that if multiple platforms such as development, test and production are needed the cost will increase.

Express

Developer

Work Group

Standard

Enterprise
CPU 1 1 2 4 No Limit
RAM 1 GB 3 GB 3 GB No Limit No Limit

Database
Size

4 GB 4 GB

No
Limit

No Limit No Limit
Advanced
Features
No No No No Yes
UI Basic Studio Studio Studio Studio
OLAP No Yes No Yes Yes
Reporting No Yes No Yes Yes
Approx Price Free $50 $3.9K $6K $25K

Please check with vendor for most current prices and features. The prices and features presented on this page are for information only.

Data and Analytics with Teradata

Teradata has a long history of supporting large scale databases aimed at data warehousing and business intelligence.  Teradata provides both the relational database software and the hardware that supports it.

Strengths

Weaknesses

  • Can support large data warehouses - over 10 TB
  • Industry data models
  • High price point
  • Not portable - requires specialized hardware

 

Consider Teradata when large databases that support many users are needed.

Database Choice for Data and Analytics

In conclusion, there a number of relational database choices available to support data warehousing and business intelligence.  These database choices cover a wide range of scalability and price.

 

 


Advertisements

Advertisements:
 


Infogoal.com is organized to help you gain mastery.
Examples may be simplified to facilitate learning.
Content is reviewed for errors but is not warranted to be 100% correct.
In order to use this site, you must read and agree to the terms of use, privacy policy and cookie policy.
Copyright 2006-2020 by Infogoal, LLC. All Rights Reserved.

Infogoal Logo