Tuesday, July 17, 2012

Extract, transform and load (ETL)

Extract, transform and load (ETL) is a process in database usage and especially in data warehousing that involves:
Extracting data from outside sources
Transforming it to fit operational needs (which can include quality levels)
Loading it into the end target (database or data warehouse)

http://en.wikipedia.org/wiki/Extract,_transform,_load


  • SSSI (SQL Server Integration Services) Microsoft

Microsoft Integration Services is a platform for building high performance data integration solutions, including extraction, transformation, and load (ETL) packages for data warehousing

Integration Services includes graphical tools and wizards for building and debugging packages; tasks for performing workflow functions such as FTP operations, executing SQL statements, and sending e-mail messages; data sources and destinations for extracting and loading data; transformations for cleaning, aggregating, merging, and copying data; a management service
http://msdn.microsoft.com/en-us/library/ms169917(v=sql.105).aspx

Ad-Hoc Query


"ad hoc" reporting systems allow the users themselves to create specific, customized queries.
Typically this would be via a user-friendly GUI-based system without the need for the in-depth knowledge of SQL, or database schema that a programmer would have.
Because such reporting has the potential to severely degrade the performance of a live system, it is usually provided over a data warehouse.
Ad hoc querying/reporting is a business intelligence subtopic, along with OLAP, data warehousing, data mining and other tools.
http://en.wikipedia.org/wiki/Ad_hoc

An Ad-Hoc Query is a query that cannot be determined prior to the moment the query is issued.
It is created in order to get information when need arises and it consists of dynamically constructed SQL which is usually constructed by desktop-resident query tools.
http://www.learn.geekinterview.com/data-warehouse/dw-basics/what-is-an-ad-hoc-query.html

ERP , SCM, CRM


Information Technology (IT) departments for solutions to achieve greater efficiency and for business software to improve customer service

Business managers are always looking to compare the cost of IT solutions with the potential return on investment (ROI)


  • ERP, the Complete Business Solution

The biggest decision and most risky solution is that of an Enterprise Resource Planning system, or ERP.
This solution can be a complete replacement for all of the business software and procedures within the company by a single suite of programs that are specifically designed to efficiently manage assets and resources based around a comprehensive financial system
The benefit of ERP is having a single solution provider for everything from accounts through manufacturing and warehouse management to human resources.
Every package is linked to allow extensive business intelligence.




  • SCM for Efficient Management of the Supply Chain

Supply Chain Management as the "design, planning, execution, control, and monitoring of supply chain activities with the objective of creating net value, building a competitive infrastructure, leveraging worldwide logistics, synchronizing supply with demand, and measuring performance globally".



  • CRM, Customer Relationship Management

A CRM software solution provides a database where every piece of communication such as telephone call; email or direct conversation can be logged to ensure that promises can be met and leads can be efficiently followed.
This is especially important where a business has many customers or those that have a team of sale staff who may need to know the details of the last conversation.

http://suite101.com/article/erp-scm-crm-business-software-solutions-a275320

  • A CRM Without Boundaries

Vtiger CRM is considered the most powerful open source CRM solution
https://www.vtiger.com/open-source/

Data WareHouse Overview


-periodically updated,relatively static
-seperated from operational databases

building and maintaning datawarehouse tools

-dbms tool: oracle,teradata
-middleware tool:any ETL tool like informatica
-metadata tool:teradata metadata  repository
-warehouse administration tool:any database administration tool like oracle enterprise manager
-OLAP & Query tools:microstrategy,cognos


  • data warehouse
-a storage area for processed and integrated data across different sources
-operational and external data

-allows the users to extract data for business analysis and strategic decision making
-supports management decision making process
-stand-alone repository of information which is ingtegrated from several operational databases

  • DATA WAREHOUSING Basics