Workload Optimized Systems: Tuning POWER7 for Analytics

Workload Optimized Systems: Tuning POWER7 for Analytics PDF

Author: Dino Quintero

Publisher: IBM Redbooks

Published: 2013-04-14

Total Pages: 200

ISBN-13: 0738437328

DOWNLOAD EBOOK →

This IBM® Redbooks® publication addresses topics to help clients to take advantage of the virtualization strengths of the POWER® platform to solve system resource utilization challenges and maximize system throughput and capacity. This publication examines the tools, utilities, documentation, and other resources available to help technical teams provide business solutions and support for Cognos® Business Intelligence (BI) and Statistical Package for the Social Sciences (SPSS®) on Power SystemsTM virtualized environments. This book addresses topics to help address complex high availability requirements, help maximize the availability of systems, and provide expert-level documentation to the worldwide support teams. This book strengthens the position of the Cognos and SPSS solutions with a well-defined and documented deployment model within a POWER system virtualized environment. This model provides clients with a planned foundation for security, scaling, capacity, resilience, and optimization. This book is targeted toward technical professionals (BI consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for providing Smart Analytics solutions and support for Cognos and SPSS on Power Systems.

Implementing an Optimized Analytics Solution on IBM Power Systems

Implementing an Optimized Analytics Solution on IBM Power Systems PDF

Author: Dino Quintero

Publisher: IBM Redbooks

Published: 2016-06-01

Total Pages: 294

ISBN-13: 0738441686

DOWNLOAD EBOOK →

This IBM® Redbooks® publication addresses topics to use the virtualization strengths of the IBM POWER8® platform to solve clients' system resource utilization challenges and maximize systems' throughput and capacity. This book addresses performance tuning topics that will help answer clients' complex analytic workload requirements, help maximize systems' resources, and provide expert-level documentation to transfer the how-to-skills to the worldwide teams. This book strengthens the position of IBM Analytics and Big Data solutions with a well-defined and documented deployment model within a POWER8 virtualized environment, offering clients a planned foundation for security, scaling, capacity, resilience, and optimization for analytics workloads. This book is targeted toward technical professionals (analytics consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for providing analytics solutions and support on IBM Power SystemsTM.

Proceedings of International Conference on Intelligent Computing, Information and Control Systems

Proceedings of International Conference on Intelligent Computing, Information and Control Systems PDF

Author: A. Pasumpon Pandian

Publisher: Springer Nature

Published: 2021-01-24

Total Pages: 972

ISBN-13: 9811584435

DOWNLOAD EBOOK →

This book is a collection of papers presented at the International Conference on Intelligent Computing, Information and Control Systems (ICICCS 2020). It encompasses various research works that help to develop and advance the next-generation intelligent computing and control systems. The book integrates the computational intelligence and intelligent control systems to provide a powerful methodology for a wide range of data analytics issues in industries and societal applications. The book also presents the new algorithms and methodologies for promoting advances in common intelligent computing and control methodologies including evolutionary computation, artificial life, virtual infrastructures, fuzzy logic, artificial immune systems, neural networks and various neuro-hybrid methodologies. This book is pragmatic for researchers, academicians and students dealing with mathematically intransigent problems.

The Effect of Information Technology on Business and Marketing Intelligence Systems

The Effect of Information Technology on Business and Marketing Intelligence Systems PDF

Author: Muhammad Alshurideh

Publisher: Springer Nature

Published: 2023-03-12

Total Pages: 2536

ISBN-13: 3031123824

DOWNLOAD EBOOK →

Business shapes have been changed these days. Change is the main dominant fact that change the way of business operations running. Topics such as innovation, entrepreneurship, leadership, blockchain, mobile business, social media, e-learning, machine learning, and artificial intelligence become essential to be considered by each institution within the technology era. This book tries to give additional views on how technologies influence business and marketing operations for insuring successful institutions survival. The world needs to develop management and intelligent business scenario plans that suite a variety of crisis appears these days. Also, business and marketing intelligence should meet government priorities in individual countries and minimise the risk of business disruptions. Business intelligence - the strategies and technology companies that use it to collect, interpret, and benefit from data - play a key role in informing company strategies, functions, and efficiency. However, being essential to the success, many companies are not taking advantage of tools that can improve their business intelligence efforts. Information technology become a core stone in business. For example, the combination of machine learning and business intelligence can have a far-reaching impact on the insights the company gets from its available data to improve productivity, quality, customer service and more. This book is important because it introduces a large number of chapters that discussed the implications of different Information technology applications in business. This book contains a set of volumes which are: 1- Social Marketing and Social Media Applications, 2- Social Marketing and Social Media Applications, 3- Business and Data Analytics, 4- Corporate governance and performance, 5- Innovation, Entrepreneurship and leadership, 6- Knowledge management, 7- Machine learning, IOT, BIG DATA, Block Chain and AI, 8- Marketing Mix, Services and Branding.

IBM Technical Computing Clouds

IBM Technical Computing Clouds PDF

Author: Dino Quintero

Publisher: IBM Redbooks

Published: 2013-10-28

Total Pages: 266

ISBN-13: 0738438782

DOWNLOAD EBOOK →

This IBM® Redbooks® publication highlights IBM Technical Computing as a flexible infrastructure for clients looking to reduce capital and operational expenditures, optimize energy usage, or re-use the infrastructure. This book strengthens IBM SmartCloud® solutions, in particular IBM Technical Computing clouds, with a well-defined and documented deployment model within an IBM System x® or an IBM Flex SystemTM. This provides clients with a cost-effective, highly scalable, robust solution with a planned foundation for scaling, capacity, resilience, optimization, automation, and monitoring. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) responsible for providing cloud-computing solutions and support.

Tuning and Optimization of Resource Management for Data Analytics Applications

Tuning and Optimization of Resource Management for Data Analytics Applications PDF

Author: Md Muhib Khan

Publisher:

Published: 2022

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK →

We are currently in the era of big data, where data of enormous volume and variety is generated continuously, and they need to be captured, processed, and analyzed at a high velocity.Many users have increasingly adopted big data analytics to analyze and extract value (e.g., supporting business decisions and uncovering new insights) from massive amounts of data. Modern cluster computing frameworks (e.g., Spark) have facilitated this widespread adoption of analytics by providing developer-friendly APIs and excellent performance on diverse systems. While modern data analytics frameworks have achieved incredible advancements in terms of speed and performance, non-trivial challenges have emerged due to various factors, such as the increasing size of datasets, increasing complexity of the configuration-performance relationship, and a shift from on-premise infrastructure to the cloud. These challenges need to be tackled for the continued growth of the big data revolution. The growing trend of larger dataset sizes translates to a higher requirement for system resources (e.g., compute, memory, storage).This increase in demand is exacerbated by the fact that modern data analytics frameworks rely heavily on memory for providing significant performance gains over the previous generation of disk-based frameworks. Currently, the decrease in DRAM price is outpaced by the growth in dataset sizes, and Non-Volatile Memory (NVM) is a promising solution for meeting the increasing demand for memory. However, a complete replacement of DRAM by NVM is not viable due to NVM having several drawbacks, namely higher access latency, lower bandwidth, and endurance. Thus hybrid memory architectures that provide higher capacity at a lower cost by combining DRAM and NVM to overcome the disadvantages of NVM have been proposed to tackle the issue of increased memory requirement. Unfortunately, current memory management mechanisms within modern data analytics frameworks are not suitable for hybrid memory and require to be redesigned and optimized for taking advantage of such systems. Even if data analytics systems have enough resources, achieving optimal utilization to extract the best workload performance is immensely challenging. For proper utilization of cluster resources, analytics workloads need to run on optimal configurations. As modern data analytics frameworks mature, more configuration parameters are introduced for adapting to new use cases and systems. While new parameters allow the frameworks to be more flexible and versatile, this increases the dimensionality of the configuration space. Each new dimension in the configuration space exponentially increases the number of possible configurations, which makes determining the optimal configuration significantly difficult. Furthermore, the relationship between workload configuration and performance is complex. Existing tuning solutions require numerous workload execution samples to train a performance model for handling the complex configuration-performance relationship. However, running analytics workloads with large datasets is costly, rendering current solutions unsuitable in most real-life scenarios. A viable automated tuner needs to recommend optimal or near-optimal configurations within a limited number of iterations and keep costs low. Another phenomenon that adds to the challenge of optimal resource allocation and utilization is the shift towards the cloud for running analytics workloads. Cloud service providers offer hundreds of Virtual Machine (VM) types that differ in compute, memory, network, and storage capabilities. Choosing the optimal number and type of VMs from the numerous possible combinations for workload deployment is a significant challenge. While contemporary solutions have advanced the field of cloud configuration tuning, they have limitations in the form of predetermined search spaces and underutilization of domain-based heuristics. This dissertation tackles the mentioned issues through three studies that propose architectural modifications and novel automated tuning frameworks for efficient data analytics.Firstly, we investigate the integration of Non-Volatile Memory (NVM) through hybridization in the memory management mechanisms of Spark. We propose several modifications to the software stack to effectively support the hybridization of the Spark cache in an optimized manner. Our evaluation results have demonstrated that the proposed hybridization strategy keeps the increase in execution time minimal while only requiring a fraction of DRAM compared to a fully DRAM system. Secondly, we offer a high-dimensional cluster configuration tuner called ROBOTune that finds optimal or near-optimal configurations for efficient data analytics. ROBOTune employs a Random Forests model to handle the high-dimensionality of the analytics configuration space and couples it with a Bayesian Optimization engine to search for optimal configurations within a limited budget. Evaluation of an extensive set of applications shows that ROBOTune finds configurations that perform better on average while significantly improving search cost and search speed compared to contemporary solutions. Thirdly, we propose a cloud resource allocation tuner BoundConfig, which utilizes framework-level execution metrics for dynamically determining a workload-specific cloud VM search space. We also employ domain-driven heuristics for identifying well-performing initial configurations to bootstrap the tuning process. Furthermore, BoundConfig couples these techniques with a Bayesian Optimizer equipped with a noise-resilient acquisition function and metric-based output constraints that guide the search. Workload-specific search spaces reduce the tuning cost, while well-performing initial configurations speed up the process. Our extensive experiments for BoundConfig on AWS EC2 have demonstrated its significant advantage in search speed and cost compared to contemporary solutions.

Performance and Capacity Implications for Big Data

Performance and Capacity Implications for Big Data PDF

Author: Dave Jewell

Publisher: IBM Redbooks

Published: 2014-02-07

Total Pages: 48

ISBN-13: 0738453587

DOWNLOAD EBOOK →

Big data solutions enable us to change how we do business by exploiting previously unused sources of information in ways that were not possible just a few years ago. In IBM® Smarter Planet® terms, big data helps us to change the way that the world works. The purpose of this IBM RedpaperTM publication is to consider the performance and capacity implications of big data solutions, which must be taken into account for them to be viable. This paper describes the benefits that big data approaches can provide. We then cover performance and capacity considerations for creating big data solutions. We conclude with what this means for big data solutions, both now and in the future. Intended readers for this paper include decision-makers, consultants, and IT architects.

IBM Software Defined Infrastructure for Big Data Analytics Workloads

IBM Software Defined Infrastructure for Big Data Analytics Workloads PDF

Author: Dino Quintero

Publisher: IBM Redbooks

Published: 2015-06-29

Total Pages: 180

ISBN-13: 0738440779

DOWNLOAD EBOOK →

This IBM® Redbooks® publication documents how IBM Platform Computing, with its IBM Platform Symphony® MapReduce framework, IBM Spectrum Scale (based Upon IBM GPFSTM), IBM Platform LSF®, the Advanced Service Controller for Platform Symphony are work together as an infrastructure to manage not just Hadoop-related offerings, but many popular industry offeringsm such as Apach Spark, Storm, MongoDB, Cassandra, and so on. It describes the different ways to run Hadoop in a big data environment, and demonstrates how IBM Platform Computing solutions, such as Platform Symphony and Platform LSF with its MapReduce Accelerator, can help performance and agility to run Hadoop on distributed workload managers offered by IBM. This information is for technical professionals (consultants, technical support staff, IT architects, and IT specialists) who are responsible for delivering cost-effective cloud services and big data solutions on IBM Power SystemsTM to help uncover insights among client's data so they can optimize product development and business results.

IBM System Storage DS8000 Performance Monitoring and Tuning

IBM System Storage DS8000 Performance Monitoring and Tuning PDF

Author: Axel Westphal

Publisher: IBM Redbooks

Published: 2016-04-07

Total Pages: 600

ISBN-13: 073844149X

DOWNLOAD EBOOK →

This IBM® Redbooks® publication provides guidance about how to configure, monitor, and manage your IBM DS8880 storage systems to achieve optimum performance, and it also covers the IBM DS8870 storage system. It describes the DS8880 performance features and characteristics, including hardware-related performance features, synergy items for certain operating systems, and other functions, such as IBM Easy Tier® and the DS8000® I/O Priority Manager. The book also describes specific performance considerations that apply to particular host environments, including database applications. This book also outlines the various tools that are available for monitoring and measuring I/O performance for different server environments, and it describes how to monitor the performance of the entire DS8000 storage system. This book is intended for individuals who want to maximize the performance of their DS8880 and DS8870 storage systems and investigate the planning and monitoring tools that are available. The IBM DS8880 storage system features, as described in this book, are available for the DS8880 model family with R8.0 release bundles (Licensed Machine Code (LMC) level 7.8.0).