Data Warehouse Developer Questions And Answers

21⟩ Please explain what is conformed fact?

☛ Conformed dimensions are the dimensions which can be used across multiple Data Marts in combination with multiple facts tables accordingly.

☛ A conformed dimension is a dimension that has exactly the same meaning and content when being referred from different fact tables. A conformed dimension can refer to multiple tables in multiple data marts within the same organization.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

207 views

22⟩ Tell us what are the different types of SCD's used in data warehousing?

SCD (Slowly changing dimensions), are the dimensions in which the data changes slowly, rather than changing regularly on a time basis.

Three types of SCDs are used in data warehousing, which are defined as:

☛ – SCD1: It is a record that is used to replace the original record even there is only one record existing in the database. The current data will be replaced and the new data will take its place.

☛ – SCD2: It is the new record file that is added to the dimension table. This record exists in the database with the current data and previous data that is stored in the history.

☛ – SCD3: This uses the original data that is modified to the new data. This consists of two records: one record that exist in the database and another record that will replace the old database record with the new information.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

206 views

23⟩ Tell us what is level of Granularity of a fact table?

A fact table is usually designed at a low level of Granularity. This means that we need to find the lowest level of information that can store in a fact table.

e.g.Employee performance is a very high level of granularity. Employee_performance_daily, employee_perfomance_weekly can be considered lower levels of granularity.

The granularity is the lowest level of information stored in the fact table. The depth of data level is known as granularity. In date dimension, the level could be year, month, quarter, period, week, day of granularity.

The process consists of the following two steps:

☛ Determining the dimensions that are to be included

☛ Determining the location to locate the hierarchy of each dimension of information. The above factors of determination will be resent to the requirements.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

207 views

24⟩ Tell us how many dimensions are selected in Slice operation?

Only one dimension is selected for the slice operation.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

203 views

25⟩ Tell us what language is the base of DMQL?

DMQL is based on Structured Query Language (SQL).

Is this answer helpful? 0 Yes | 0 No

Answer This Question

200 views

26⟩ Do you know what is Virtual Data Warehousing?

☛ A virtual data warehouse provides a collective view of the completed data. A virtual data warehouse has no historic data. It can be considered as a logical data model of the containing metadata.

☛ Virtual data warehousing is a ‘de facto’ information system strategy for supporting analytical decision making. It is one of the best ways for translating raw data and presenting it in the form that can be used by decision makers. It provides semantic map – which allows the end user for viewing as virtualized.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

195 views

27⟩ Do you know what is a warehouse manager?

Warehouse manager is responsible for the warehouse management process. The warehouse manager consist of third party system software, C programs and shell scripts. The size and complexity of warehouse manager varies between specific solutions.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

196 views

28⟩ Can you explain me how does a Data Cube help?

Data cube helps us to represent the data in multiple dimensions. The data cube is defined by dimensions and facts.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

206 views

29⟩ Tell us what is the benefit of normalization?

Normalization helps in reducing data redundancy.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

186 views

30⟩ Explain me which language is used for defining Schema Definition?

Data Mining Query Language (DMQL) is used for Schema Definition.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

188 views

31⟩ What is the functions performed by OLAP?

OLAP performs functions such as roll-up, drill-down, slice, dice, and pivot.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

182 views

32⟩ Explain me how many dimensions are selected in dice operation?

For dice operation two or more dimensions are selected for a given cube.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

186 views

33⟩ Can you tell me which one is faster, Multidimensional OLAP or Relational OLAP?

Multidimensional OLAP is faster than Relational OLAP.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

194 views

34⟩ Explain me what is ODS?

☛ An operational data store (“ODS”) is a database designed to integrate data from multiple sources for additional operations on the data. Unlike a master data store, the data is not sent back to operational systems. It may be passed for further operations and to the data warehouse for reporting.

☛ In ODS, data can be scrubbed, resolved for redundancy and checked for compliance with the corresponding business rules. This data store can be used for integrating disparate data from multiple sources so that business operations, analysis and reporting can be carried while business operations occur. This is the place where most of the data used in current operation is housed before it’s transferred to the data warehouse for longer term storage or archiving.

☛ An ODS is designed for relatively simple queries on small amounts of data (such as finding the status of a customer order), rather than the complex queries on large amounts of data typical of the data warehouse.

☛ An ODS is similar to your short term memory where it only stores very recent information. On the contrary, the data warehouse is more like long term memory storing relatively permanent information.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

186 views

35⟩ Explain me which one is faster, Multidimensional OLAP or Relational OLAP?

☛ Multidimensional OLAP is faster than Relational OLAP.

☛ MOLAP: Multi-dimensional OLAP

☛ Data is stored in a multidimensional cube. The storage is not in the relational database, but in proprietary formats (one example is PowerOLAP’s .olp file). MOLAP products can be compatible with Excel, which can make data interactions easy to learn.

☛ ROLAP: Relational OLAP

☛ ROLAP products access a relational database by using SQL (structured query language), which is the standard language that is used to define and manipulate data in an RDBMS. Subsequent processing may occur in the RDBMS or within a mid-tier server, which accepts requests from clients, translates them into SQL statements, and passes them on to the RDBMS.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

196 views

36⟩ Tell us what do you mean by Data Extraction?

Data extraction means gathering data from multiple heterogeneous sources.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

206 views

37⟩ Do you know what is Data Warehousing?

Data Warehousing is the process of constructing and using the data warehouse.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

192 views

38⟩ Tell us what is the difference between agglomerative and divisive Hierarchical Clustering?

☛ Agglomerative Hierarchical clustering method allows the clusters to be read from bottom to top so that the program always reads from the sub-component first then moves to the parent whereas Divisive Hierarchical clustering uses top-bottom approach in which the parent is visited first than the child.

☛ Agglomerative hierarchical method consists of objects in which each object creates its own clusters and these clusters are grouped together to create a large cluster. It defines a process of continuous merging until all the single clusters are merged together into a complete big cluster that will consist of all the objects of child clusters. However, in divisive clustering, the parent cluster is divided into smaller cluster and it keeps on dividing until each cluster has a single object to represent.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

241 views

39⟩ Tell us how many fact tables are there in a star schema?

There is only one fact table in a star Schema.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

189 views

40⟩ Explain me what is active data warehousing?

☛ An active data warehouse represents a single state of the business. Active data warehousing considers the analytic perspectives of customers and SUPPLIERS. It helps to deliver the updated data through reports.

☛ A form of repository of captured transactional data is known as ‘active data warehousing’. Using this concept, trends and patterns are found to be used for future decision making. Active data warehouse has a feature which can integrate the changes of data while scheduled cycles refresh. Enterprises utilize an active data warehouse in drawing the company’s image in statistical manner.

Is this answer helpful? 0 Yes | 0 No

Answer This Question

180 views

Data Warehouse Developer

Home Data Warehouse Data Warehouse Developer

55 Data Warehouse Developer Questions And Answers

21⟩ Please explain what is conformed fact?

22⟩ Tell us what are the different types of SCD's used in data warehousing?

23⟩ Tell us what is level of Granularity of a fact table?

24⟩ Tell us how many dimensions are selected in Slice operation?

25⟩ Tell us what language is the base of DMQL?

26⟩ Do you know what is Virtual Data Warehousing?

27⟩ Do you know what is a warehouse manager?

28⟩ Can you explain me how does a Data Cube help?

29⟩ Tell us what is the benefit of normalization?

30⟩ Explain me which language is used for defining Schema Definition?

31⟩ What is the functions performed by OLAP?

32⟩ Explain me how many dimensions are selected in dice operation?

33⟩ Can you tell me which one is faster, Multidimensional OLAP or Relational OLAP?

34⟩ Explain me what is ODS?

35⟩ Explain me which one is faster, Multidimensional OLAP or Relational OLAP?

36⟩ Tell us what do you mean by Data Extraction?

37⟩ Do you know what is Data Warehousing?

38⟩ Tell us what is the difference between agglomerative and divisive Hierarchical Clustering?

39⟩ Tell us how many fact tables are there in a star schema?

40⟩ Explain me what is active data warehousing?

Quick Links:

Data Warehouse Developer

Home Data Warehouse Data Warehouse Developer

55 Data Warehouse Developer Questions And Answers

21⟩ Please explain what is conformed fact?

22⟩ Tell us what are the different types of SCD's used in data warehousing?

23⟩ Tell us what is level of Granularity of a fact table?

24⟩ Tell us how many dimensions are selected in Slice operation?

25⟩ Tell us what language is the base of DMQL?

26⟩ Do you know what is Virtual Data Warehousing?

27⟩ Do you know what is a warehouse manager?

28⟩ Can you explain me how does a Data Cube help?

29⟩ Tell us what is the benefit of normalization?

30⟩ Explain me which language is used for defining Schema Definition?

31⟩ What is the functions performed by OLAP?

32⟩ Explain me how many dimensions are selected in dice operation?

33⟩ Can you tell me which one is faster, Multidimensional OLAP or Relational OLAP?

34⟩ Explain me what is ODS?

35⟩ Explain me which one is faster, Multidimensional OLAP or Relational OLAP?

36⟩ Tell us what do you mean by Data Extraction?

37⟩ Do you know what is Data Warehousing?

38⟩ Tell us what is the difference between agglomerative and divisive Hierarchical Clustering?

39⟩ Tell us how many fact tables are there in a star schema?

40⟩ Explain me what is active data warehousing?

BE THE FIRST TO KNOW

Quick Links: