{"id":583,"date":"2023-03-16T13:00:43","date_gmt":"2023-03-16T13:00:43","guid":{"rendered":"https:\/\/pc-keeper.tech\/index.php\/2023\/03\/16\/navigate-etl-testing-align-data\/"},"modified":"2023-03-16T13:00:43","modified_gmt":"2023-03-16T13:00:43","slug":"navigate-etl-testing-align-data","status":"publish","type":"post","link":"https:\/\/pc-keeper.tech\/index.php\/2023\/03\/16\/navigate-etl-testing-align-data\/","title":{"rendered":"Navigate ETL Testing &#038; Align Data"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-337318 img-responsive alignright\" src=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133019\/How-to-extract-and-transform-data-with-ETL-testing.jpg\" alt=\"How to extract and transform data with ETL testing\" width=\"250\" height=\"250\" srcset=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133019\/How-to-extract-and-transform-data-with-ETL-testing.jpg 250w, https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133019\/How-to-extract-and-transform-data-with-ETL-testing-150x150.jpg 150w, https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133019\/How-to-extract-and-transform-data-with-ETL-testing-100x100.jpg 100w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\"\/>This article will cover the main aspects of ETL testing you might need to know.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">What is ETL testing?<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">ETL stands for Extract-Transform-Load. ETL testing is conducted after a business transformation to ensure that data loaded from the source to the destination is accurate. For example, tools like the Apache Hive components software are available for this purpose.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Data is extracted from an online transaction processing database and often non-online transaction processing systems such as spreadsheets or text files. This is transformed to match the existing data type in the data warehouse and then loaded into the data warehouse database.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">An example of ETL<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<figure id=\"attachment_337319\" aria-describedby=\"caption-attachment-337319\" style=\"width: 300px\" class=\"wp-caption alignright\"><img decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-337319 img-responsive\" src=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133543\/ETL-Testing-300x172.png\" alt=\"diagram of ETL Testing\" width=\"300\" height=\"172\" srcset=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133543\/ETL-Testing-300x172.png 300w, https:\/\/ieeecs-media.computer.org\/wp-media\/2023\/03\/13133543\/ETL-Testing.png 512w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\"\/><figcaption id=\"caption-attachment-337319\" class=\"wp-caption-text\">Source<\/figcaption><\/figure>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">If you have a charitable organization with different departments such as fundraising, marketing, and accounting, each department will be handling the information of volunteers and donors in their own ways and also store that data differently. This is even more likely with hybrid working scenarios.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">If they had to manually cross-reference every individual on their systems, it would take a long time. A data warehouse for storing data from different sources in a single, unified structure with ETL massively speeds up this process.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">What does ETL testing look for?<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">ETL testing verifies whether, for example, your inventory forecasting data has been moved as expected. It checks whether the counts in both the source and the target match, and is used to see if the transformed data matches expectations.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">It also checks that the foreign primary key relations have been preserved during the process, and for any duplications in the loaded data.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">ETL testing method<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">The first part of ETL testing consists of preparation \u2013 designing the most appropriate test strategy based on the ETL process logic and transformation rules. The test strategy consists of a document with a list of information about the test length, the reasons, and methods for testing, and the tools and people needed. This strategy is then organized into test cases \u2013 numbered instructions for running the test.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Testers need to look at the ETL process documentation, such as technical specifications, mapping specs, and business requirements, to decide which data should be integrated, including their sources, formats, and destinations.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Testers also need to ascertain the transformation rules for transforming and cleaning the data, and where and how the data will be loaded. Then test data is generated. Real-time synthetic data is needed against which testers can run tests.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">After this, test cases and scripts are created. These test cases are used to describe the things to be checked and the method for testing. ETL test cases are normally written in SQL. They also tend to have two queries \u2013 one which extracts data from its source, and another which extracts it from the target storage.<\/p>\n<p>\u00a0<\/p>\n<hr style=\"width: 100%;\"\/>\n<p>\u00a0<\/p>\n<p style=\"text-align: center; color: #ff6600;\"><strong>Want More Tech News? Subscribe to <i>ComputingEdge<\/i> Newsletter Today!<\/strong><\/p>\n<p>\u00a0<\/p>\n<hr style=\"width: 100%;\"\/>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">Three types of ETL tests<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">There are essentially three types of ETL tests, which we will describe here.<\/p>\n<p>\u00a0<\/p>\n<h3 style=\"color: #002855; font-size: 20px; font-family: Montserrat; font-weight: 500; line-height: 24px;\">ETL performance tests<\/h3>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">During the data integration testing, the entire application is tested in an environment that imitates production. Various functions, such as interfaces, databases, etc., are all tested against the data. This process allows you to find any obstructions and ensure that the process is ready to scale up with the increasing amount of data.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">This method tests whether data loads and queries are carried out within the expected timeframe, and load times for various data volumes.<\/p>\n<p>\u00a0<\/p>\n<h3 style=\"color: #002855; font-size: 20px; font-family: Montserrat; font-weight: 500; line-height: 24px;\">Data quality tests<\/h3>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">This test confirms that data has been extracted properly, without any losses or duplicates.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Duplicate data is checked, to make sure there are no rows with the same unique key column or column combinations. Data is validated in accordance with validation rules to check for any incorrect dates, input errors, or inaccurate symbols that could render the data invalid.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Metadata testing checks that definitions conform to the data model and application design, and reference testing checks data against required attributes.<\/p>\n<p>\u00a0<\/p>\n<h3 style=\"color: #002855; font-size: 20px; font-family: Montserrat; font-weight: 500; line-height: 24px;\">Data transformation success tests<\/h3>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Data transformation testing is used to check that data is transformed as required. The two approaches to this are white box testing and black box testing.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">White box transformation testing entails testing application code. A tester reviews how the source-to-target transformation happens in the documentation, and then uses this logic in code to compare the transformed data to the documentation.<\/p>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Black box transformation testing allows you to check how the system functions. The tester prepares data to mirror various transformation scenarios listed in the documentation. You don\u2019t need coding for this method.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">To sum up\u2026<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\">Whether you\u2019re in the business of manufacturing ERP software, running a small coffee business, or anything in between, ETL testing can help you to align your data in one single, unified data warehouse.<\/p>\n<p>\u00a0<\/p>\n<h2 style=\"color: #002855; font-size: 24px; font-family: Montserrat; font-weight: 500; line-height: 29px;\">About the Writer<\/h2>\n<hr style=\"text-align: left; width: 30%; height: 3px; color: #ffa300; background-color: #ffa300; border: none;\"\/>\n<p style=\"color: #454545; font-size: 18px; font-family: Open Sans; font-weight: 400; line-height: 1.7em;\"><img decoding=\"async\" loading=\"lazy\" class=\"img-responsive alignleft wp-image-283798 size-thumbnail\" src=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2022\/06\/22000948\/pohan-lin-headshot-150x150.jpg\" alt=\"Pohan Lin\" width=\"150\" height=\"150\" srcset=\"https:\/\/ieeecs-media.computer.org\/wp-media\/2022\/06\/22000948\/pohan-lin-headshot-150x150.jpg 150w, https:\/\/ieeecs-media.computer.org\/wp-media\/2022\/06\/22000948\/pohan-lin-headshot-300x300.jpg 300w, https:\/\/ieeecs-media.computer.org\/wp-media\/2022\/06\/22000948\/pohan-lin-headshot-100x100.jpg 100w, https:\/\/ieeecs-media.computer.org\/wp-media\/2022\/06\/22000948\/pohan-lin-headshot.jpg 400w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\"\/>Pohan Lin is the Senior Web Marketing and Localizations Manager at Databricks, a global Data and AI provider connecting the features of data warehouses and data lakes to create lakehouse architecture. With over 18 years of experience in web marketing, online SaaS business, and e-commerce growth. Pohan is passionate about innovation and is dedicated to communicating the significant impact data has in marketing. Pohan Lin also published articles for domains such as Landbot.<\/p>\n<p>\u00a0<\/p>\n<div style=\"background-color: #d4f1f4; padding: 15px 15px 10px 15px;\">\n<p style=\"color: #454545; font-size: 18px; line-height: 1.7em;\"><strong>Disclaimer:<\/strong> The author is completely responsible for the content of this article. The opinions expressed are their own and do not represent IEEE\u2019s position nor that of the Computer Society nor its Leadership.<\/p>\n<\/div><\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/www.computer.org\/publications\/tech-news\/trends\/etl-testing\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] This article will cover the main aspects of ETL testing you might need to know. \u00a0 What is ETL&hellip;<\/p>\n","protected":false},"author":1,"featured_media":584,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"class_list":["post-583","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech-news-post"],"_links":{"self":[{"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/posts\/583","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/comments?post=583"}],"version-history":[{"count":0,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/posts\/583\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/media\/584"}],"wp:attachment":[{"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/media?parent=583"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/categories?post=583"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/pc-keeper.tech\/index.php\/wp-json\/wp\/v2\/tags?post=583"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}