Narrow your search

Library

ULB (22)

KU Leuven (19)

ULiège (17)

Odisee (13)

Thomas More Kempen (13)

Thomas More Mechelen (13)

UCLL (13)

VIVES (13)

UGent (8)

UCLouvain (5)

More...

Resource type

book (21)

periodical (1)


Language

English (21)

French (1)


Year
From To Submit

2022 (1)

2021 (1)

2020 (1)

2018 (1)

2015 (1)

More...
Listing 1 - 10 of 22 << page
of 3
>>
Sort by

Book
Big data : nouvelles partitions de l'information : actes du colloque de l'INRIA, octobre 2014
Authors: --- --- ---
ISSN: 22953825 ISBN: 9782804189150 2804189155 Year: 2015 Volume: *4 Publisher: Bruxelles : De Boeck,

Loading...
Export citation

Choose an application

Bookmark

Abstract

Le Big Data est omniprésent dans les médias. Qualifié de source d?innovation, de richesses, de création d?emplois, d?enjeu démocratique quand il est ± open , le Big Data fascine et effraye à la fois. Mais de quoi parle-t-on exactement ? Ces données massives sont-elles du seul domaine des informaticiens, des statisticiens, des politiques et des créateurs d?entreprises ? Les professionnels de l?information-documentation n?ont-ils pas un rôle à jouer dans ce nouveau paysage : identification, qualification, archivage, classification ? Cet ouvrage rassemble les contributions de spécialistes issus de diverses disciplines et réunis au colloque Inria en octobre 2014. Dans le flou lié à la mutation profonde que connaît actuellement le paysage informationnel, ils donnent les clés pour appréhender ce nouveau domaine et pour percevoir la place réservée aux compétences métier de l?information-documentation.


Periodical
Distributed and parallel databases.
ISSN: 15737578 09268782 Year: 1993 Publisher: [Dordrecht] : [New York] : Kluwer Academic Publishers Springer US


Book
Beginning Apache Spark 3 : with DataFrame, Spark SQL, structured streaming, and Spark machine learning library
Author:
ISBN: 1484273834 1484273826 Year: 2021 Publisher: New York, New York : Apress,

Loading...
Export citation

Choose an application

Bookmark

Abstract

Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and streaming; and the scalable machine learning algorithms and practical utilities to build machine learning applications. Beginning Apache Spark 3 begins by explaining different ways of interacting with Apache Spark, such as Spark Concepts and Architecture, and Spark Unified Stack. Next, it offers an overview of Spark SQL before moving on to its advanced features. It covers tips and techniques for dealing with performance issues, followed by an overview of the structured streaming processing engine. It concludes with a demonstration of how to develop machine learning applications using Spark MLlib and how to manage the machine learning development lifecycle. This book is packed with practical examples and code snippets to help you master concepts and features immediately after they are covered in each section. After reading this book, you will have the knowledge required to build your own big data pipelines, applications, and machine learning applications. What You Will Learn Master the Spark unified data analytics engine and its various components Work in tandem to provide a scalable, fault tolerant and performant data processing engine Leverage the user-friendly and flexible programming model to perform simple to complex data analytics using dataframe and Spark SQL Develop machine learning applications using Spark MLlib Manage the machine learning development lifecycle using MLflow Who This Book Is For Data scientists, data engineers and software developers.


Book
Principles of Distributed Database Systems
Authors: ---
ISBN: 1441988343 Year: 2011 Publisher: New York, NY : Springer New York : Imprint: Springer,

Loading...
Export citation

Choose an application

Bookmark

Abstract

This third edition of a classic textbook can be used to teach at the senior undergraduate and graduate levels. The material concentrates on fundamental theories as well as techniques and algorithms. The advent of the Internet and the World Wide Web, and, more recently, the emergence of cloud computing and streaming data applications, has forced a renewal of interest in distributed and parallel data management, while, at the same time, requiring a rethinking of some of the traditional techniques. This book covers the breadth and depth of this re-emerging field. The coverage consists of two parts. The first part discusses the fundamental principles of distributed data management and includes distribution design, data integration, distributed query processing and optimization, distributed transaction management, and replication. The second part focuses on more advanced topics and includes discussion of parallel database systems, distributed object management, peer-to-peer data management, web data management, data stream systems, and cloud computing. New in this Edition: • New chapters, covering database replication, database integration, multidatabase query processing, peer-to-peer data management, and web data management. • Coverage of emerging topics such as data streams and cloud computing • Extensive revisions and updates based on years of class testing and feedback Ancillary teaching materials are available.


Book
Expert Oracle RAC Performance Diagnostics and Tuning
Author:
ISBN: 1430267100 1430267097 Year: 2014 Publisher: Berkeley, CA : Apress : Imprint: Apress,

Loading...
Export citation

Choose an application

Bookmark

Abstract

Expert Oracle RAC Performance Diagnostics and Tuning provides comprehensive coverage of the features, technology and principles for testing and tuning RAC databases. The book takes a deep look at optimizing RAC databases by following a methodical approach based on scientific analysis rather than using a speculative approach, twisting and turning knobs and gambling on the system. The book starts with the basic concepts of tuning methodology, capacity planning, and architecture. Author Murali Vallath then dissects the various tiers of the testing implementation, including the operating system, the network, the application, the storage, the instance, the database, and the grid infrastructure. He also introduces tools for performance optimization and thoroughly covers each aspect of the tuning process, using many real-world examples, analyses, and solutions from the field that provide you with a solid, practical, and replicable approach to tuning a RAC environment. The book concludes with troubleshooting guidance and quick reference of all the scripts used in the book. Expert Oracle RAC Performance Diagnostics and Tuning covers scenarios and details never discussed before in any other performance tuning books. If you have a RAC database, this book is a requirement. Get your copy today. Takes you through optimizing the various tiers of the RAC environment. Provides real life case studies, analysis and solutions from the field. Maps a methodical approach to testing, tuning and diagnosing the cluster.


Book
Beginning Apache Cassandra Development
Author:
ISBN: 1484201426 1484201434 Year: 2014 Publisher: Berkeley, CA : Apress : Imprint: Apress,

Loading...
Export citation

Choose an application

Bookmark

Abstract

Beginning Apache Cassandra Development introduces you to one of the most robust and best-performing NoSQL database platforms on the planet. Apache Cassandra is a document database following the JSON document model. It is specifically designed to manage large amounts of data across many commodity servers without there being any single point of failure. This design approach makes Apache Cassandra a robust and easy-to-implement platform when high availability is needed. Apache Cassandra can be used by developers in Java, PHP, Python, and JavaScript—the primary and most commonly used languages. In Beginning Apache Cassandra Development, author and Cassandra expert Vivek Mishra takes you through using Apache Cassandra from each of these primary languages. Mishra also covers the Cassandra Query Language (CQL), the Apache Cassandra analog to SQL. You'll learn to develop applications sourcing data from Cassandra, query that data, and deliver it at speed to your application's users. Cassandra is one of the leading NoSQL databases, meaning you get unparalleled throughput and performance without the sort of processing overhead that comes with traditional proprietary databases. Beginning Apache Cassandra Development will therefore help you create applications that generate search results quickly, stand up to high levels of demand, scale as your user base grows, ensure operational simplicity, and—not least—provide delightful user experiences.

Listing 1 - 10 of 22 << page
of 3
>>
Sort by