Data Warehousing Questions And Answers

21⟩ What is the difference between star and snowflake schemas?

Star schema:

A single fact table with N number of DimensionSnowflake schema: Any dimensions with extended dimensions are known as snowflake schema.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

145 views

22⟩ Can a dimension table contain numeric values?

Yes. However, those data type will be char (only the values can numeric/char).Yes, dimensions even contain numerical because these are descriptive elements of our business.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

121 views

23⟩ How many clustered indexes can u create for a table in DWH? In case of truncate and delete command what happens to table, which has unique id.

You can have only one clustered index per table. If you use delete command, you can rollback... it fills your redo log files.

If you do not want records, you may use truncate command, which will be faster and does not fill your redo log file.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

145 views

24⟩ How do you create Surrogate Key using Ab Initio?

There are many ways to create Surrogate key but it depends on your business logic. Here you can try these ways.1. Use next in sequence () function in your transform

2. Use Assign key values component (if your GDE is higher than 1.10)

3. Write a stored proc to this and call this store proc wherever you need.Yes, dimension table contains numeric but not contain measures and facts

Is this answer helpful? 0 Yes | 0 No

Answer This Question

128 views

25⟩ What is hybrid slowly changing dimension?

Hybrid SCDs are combination of both SCD 1 and SCD 2.It may happen that in a table, some columns are important and we need to track changes for them i.e. capture the historical data for them whereas in some columns even if the data changes, we don't care.For such tables we implement Hybrid SCDs, where in some columns are Type 1 and some are Type 2.You can add that it is not an intelligent key but similar to a sequence number and tied to a timestamp typically!

Is this answer helpful? 0 Yes | 0 No

Answer This Question

142 views

26⟩ What is VLDB?

The perception of what constitutes a VLDB continues to grow. A one-terabyte database would normally be considered VLDB.Degenerate dimension: it does not have any link with dimensions and it will not have any attribute.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

118 views

27⟩ What is degenerate dimension table?

Degenerate Dimensions: If a table contains the values, which r neither dimension nor measures is called degenerate dimensions. For example invoice id, employee no.A degenerate dimension is data that is dimensional in nature but stored in a fact table.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

132 views

28⟩ What is Dimensional Modeling?

Dimensional Modeling is a design concept used by many data warehouse designers to build their data warehouse. In this design model all the data is stored in two types of tables - Facts table and Dimension table. Fact table contains the facts/measurements of the business and the dimension table contains the context of measurements i.e., the dimensions on which the facts are calculated.Dimension modeling is a method for designing data warehouse. Three types of modeling are there

1. Conceptual modeling

2. Logical modeling

3. Physical modeling

Is this answer helpful? 0 Yes | 0 No

Answer This Question

142 views

29⟩ What is meant by metadata in context of a Data warehouse and how it is important?

Metadata is the data about data; Business Analyst or data modeler usually capture information about data - the source (where and how the data is originated), nature of data (char, varchar, nullable, existence, valid values etc) and behavior of data (how it is modified / derived and the life cycle) in data dictionary a.k.a metadata.

Metadata is also presented at the Datamart level, subsets, fact and dimensions, ODS etc. For a DW user, metadata provides vital information for analysis / DSS.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

158 views

30⟩ What are the possible data marts in Retail sales?

Product information and sales information

Is this answer helpful? 0 Yes | 0 No

Answer This Question

147 views

31⟩ What are the various ETL tools in the Market?

Various ETL tools used in market are Informatica Data Stage Oracle Warehouse Builder Ab Initio Data Junction

Is this answer helpful? 0 Yes | 0 No

Answer This Question

137 views

32⟩ What is a linked cube?

Linked cube in which a sub-set of the data can be analyzed into detail. The linking ensures that the data in the cubes remain consistent.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

147 views

33⟩ What are the data types present in BO? What happens if we implement view in the designer n report?

Three different data types: Dimensions, Measure, and DetailView is nothing but an alias and it can be used to resolve the loops in the universe.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

160 views

34⟩ What is surrogate key? Where we use it? Explain with examples.

Surrogate key is a substitution for the natural primary key.It is just a unique identifier or number for each row that can be used for the primary key to the table. The only requirement for a surrogate primary key is that it is unique for each row in the table.

Data warehouses typically use a surrogate, (also known as artificial or identity key), key for the dimension tables primary keys. They can use Info sequence generator, or Oracle sequence, or SQL Server Identity values for the surrogate key.

It is useful because the natural primary key (i.e. Customer Number in Customer table) can change and this makes updates more difficult.

Some tables have columns such as AIRPORT_NAME OR CITY_NAME which are stated as the primary keys (according to the business users) but ,not only can these change, indexing on a numerical value is probably better and you could consider creating a surrogate key called, say, AIRPORT_ID. This would be internal to the system and as far as the client is concerned, you may display only the AIRPORT_NAME.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

134 views

35⟩ What are data validation strategies for data mart validation after loading process?

Data validation is to make sure that the loaded data is accurate and meets the business requirements. Strategies are different methods followed to meet the validation requirements.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

155 views

36⟩ What is Data warehousing Hierarchy?

Hierarchies are logical structures that use ordered levels as a means of organizing data. A hierarchy can be used to define data aggregation. For example, in a time dimension, a hierarchy might aggregate data from the month level to the quarter level to the year level. A hierarchy can also be used to define a navigational drill path and to establish a family structure.Within a hierarchy, each level is logically connected to the levels above and below it. Data values at lower levels aggregate into the data values at higher levels. A dimension can be composed of more than one hierarchy. For example, in the product dimension, there might be two hierarchies--one for product categories and one for product suppliers.Dimension hierarchies also group levels from general to granular. Query tools use hierarchies to enable you to drill down into your data to view different levels of granularity. This is one of the key benefits of a data warehouse.When designing hierarchies, you must consider the relationships in business structures. Hierarchies impose a family structure on dimension values. For a particular level value, a value at the next higher level is its parent, and values at the next lower level are its children. These familial relationships enable analysts to access data quickly.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

137 views

37⟩ What is BUS Schema?

BUS Schema is composed of a master suite of confirmed dimension and standardized definition if facts.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

154 views

38⟩ Why fact table is in normal form?

The fact table consists of the Index keys of the dimension/look up tables and the measures. So whenever we have the keys in a table. That it implies that the table is in the normal form.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

154 views

39⟩ What is conformed fact?

Conformed dimensions are the dimensions, which can be used across multiple Data Marts in combination with multiple facts tables accordingly

Is this answer helpful? 0 Yes | 0 No

Answer This Question

182 views

40⟩ What is Difference between E-R Modeling and Dimensional Modeling?

Basic difference is E-R modeling will have logical and physical model. Dimensional model will have only physical model. E-R modeling is used for normalizing the OLTP database design.Dimensional modeling is used for de-normalizing the ROLAP/MOLAP design.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

147 views

Data Warehousing

Home Data Warehouse Data Warehousing

131 Data Warehousing Questions And Answers

21⟩ What is the difference between star and snowflake schemas?

22⟩ Can a dimension table contain numeric values?

23⟩ How many clustered indexes can u create for a table in DWH? In case of truncate and delete command what happens to table, which has unique id.

24⟩ How do you create Surrogate Key using Ab Initio?

25⟩ What is hybrid slowly changing dimension?

26⟩ What is VLDB?

27⟩ What is degenerate dimension table?

28⟩ What is Dimensional Modeling?

29⟩ What is meant by metadata in context of a Data warehouse and how it is important?

30⟩ What are the possible data marts in Retail sales?

31⟩ What are the various ETL tools in the Market?

32⟩ What is a linked cube?

33⟩ What are the data types present in BO? What happens if we implement view in the designer n report?

34⟩ What is surrogate key? Where we use it? Explain with examples.

35⟩ What are data validation strategies for data mart validation after loading process?

36⟩ What is Data warehousing Hierarchy?

37⟩ What is BUS Schema?

38⟩ Why fact table is in normal form?

39⟩ What is conformed fact?

40⟩ What is Difference between E-R Modeling and Dimensional Modeling?

Quick Links:

Data Warehousing

Home Data Warehouse Data Warehousing

131 Data Warehousing Questions And Answers

21⟩ What is the difference between star and snowflake schemas?

22⟩ Can a dimension table contain numeric values?

23⟩ How many clustered indexes can u create for a table in DWH? In case of truncate and delete command what happens to table, which has unique id.

24⟩ How do you create Surrogate Key using Ab Initio?

25⟩ What is hybrid slowly changing dimension?

26⟩ What is VLDB?

27⟩ What is degenerate dimension table?

28⟩ What is Dimensional Modeling?

29⟩ What is meant by metadata in context of a Data warehouse and how it is important?

30⟩ What are the possible data marts in Retail sales?

31⟩ What are the various ETL tools in the Market?

32⟩ What is a linked cube?

33⟩ What are the data types present in BO? What happens if we implement view in the designer n report?

34⟩ What is surrogate key? Where we use it? Explain with examples.

35⟩ What are data validation strategies for data mart validation after loading process?

36⟩ What is Data warehousing Hierarchy?

37⟩ What is BUS Schema?

38⟩ Why fact table is in normal form?

39⟩ What is conformed fact?

40⟩ What is Difference between E-R Modeling and Dimensional Modeling?

BE THE FIRST TO KNOW

Quick Links: