Complete Data Mining (CST466)

Main Purpose: A Database is designed for day-to-day operational transactions (OLTP), while a Data Warehouse is built for complex analysis and reporting (OLAP).
Type of Data: Databases store current, highly detailed, rapidly changing data, whereas Data Warehouses store both current and historical data in a static, summarized format.
Query Complexity: Queries in a database are usually simple and execute instantly, but Data Warehouse queries are highly complex, aggregating large volumes of data for trend analysis.

Mark as Completed

Task 1 (Department to Student): Moving from a summarized view (department averages) down to a highly detailed view (individual student performance) is a Drill-down operation.
Task 2 (Filtering by CSE and 2024): Selecting a specific sub-cube by defining constraints on dimensions (like year=2024 and dept=CSE) is a Slice and Dice operation.
Task 3 (Time-based to Location-based): Rearranging or rotating the data cube axes to view the exact same data from a completely different perspective is a Pivot (Rotate) operation.

Mark as Completed

Star Schema Structure: Consists of a massive central Fact Table connected directly to multiple un-normalized Dimension Tables, forming a distinct star shape.
Star Schema Simplicity: It is the simplest architecture where each dimension is represented by a single table, making database queries very fast and easy to write.
Snowflake Schema Structure: An extension of the star schema where some dimension tables are further normalized and broken down into additional related tables.
Snowflake Appearance: Because dimensions are branched out (e.g., Product -> Category -> Subcategory), the entity-relationship diagram resembles a snowflake.
Normalization vs Space: The Snowflake schema strictly uses normalization to reduce data redundancy, which saves significant storage space compared to the Star schema.
Query Performance: While saving space, Snowflake schemas suffer from slower query performance because fetching data requires more complex "joins" across multiple tables.
Maintenance: Star schemas are much easier to maintain for simpler business models, while Snowflake is better suited for managing highly complex hierarchical dimensions.
Standard Usage: Star schemas are typically deployed in smaller Data Marts, whereas Snowflake is the preferred choice for massive, large-scale enterprise Data Warehouses.

Mark as Completed

Data Mining (CST466) - Complete Master Bank