PENTAHO PROJECT

Đã Đóng Đã đăng vào 3 năm trước Thanh toán khi bàn giao
Đã Đóng Thanh toán khi bàn giao

Part A

• You are required to discuss and choose a data warehouse domain as listed below:

i) Manufacturing

ii) Plantation

iii) Technology

iv) Health

• Write business description on the chosen domain and from there list all the business rules.

• Based on the business rules construct ER-D.

Part B

• Design the star schema / snowflake / starflakes schema for the previous business rules and ER-D that you have constructed, ensuring your Dimensions are conformed, primary and foreign keys are clearly labelled, and that your attributes are named using verbose textual descriptions.

• Explain the schema and justify why you choose the above schema.

Part C

• Based on the schema in Part B, create the raw data in CSV files. Using Pentaho data integration software, implement your dimensional model as well as the ETL process using various steps which are suitable.

• To store the data, you will use MySQL.

• For each table, you are expected to have at least 50 records.

Part D

• Create 10 queries to answer various business questions from your data warehouse.

The examples of the questions are as the following: (Please note that this is only example, your actual questions should suit with your data warehouse)

• You are expected to use some OLAP functions in your queries.

1. Which record/group/artist has given the record label the highest/lowest gain/return on investment per week/month/year.

2. What seller/city/country has the highest number of demands that could not be met?

3. Which record/group/artist has the highest number of demands?

MySQL

ID dự án: #29651282

Về dự án

Dự án từ xa 2 năm trước đang mở