Neo4j est un système de gestion de bases de données NoSQL orientées graphe, particulièrement adapté à des situations où les données sont fortement connectées et structurées selon des modèles complexes et évolutifs. À la différence des systèmes classiques, son approche n'est pas fondée sur l'algèbre relationnelle mais sur la théorie des graphes. Il permet ainsi de trouver de manière très performante des données reliées entre elles selon une profondeur non connue à l’avance. Ce livre vous permettra de prendre connaissance avec l'approche graphe de Neo4j et d'en mesurer le potentiel. Il vous fournira tous les outils pour pouvoir le tester aussi bien en local que sur un serveur distant. Vous apprendrez à l'aide d'exemples concrets à manipuler les données avec son langage d'interrogation CYPHER et via son API REST. Il s'adresse à tous les développeurs, responsables de projets, architectes logiciels en quête d'une alternative aux systèmes de bases de données relationnelles classiques. Il ne requiert aucune compétence particulière en matière de théorie des graphes. En revanche, une connaissance des bases de données traditionnelles aidera à mieux cerner les problématiques auxquelles répond Neo4j. Cette deuxième édition a été enrichie d'un nouveau chapitre portant sur les graphgists et l'analyse d'un cas d'utilisation. Son code a été entièrement révisé pour être compatible avec la version 3.0. et différents points ont été complétés pour intégrer les multiples changements apparus depuis la version 2.0 de Neo4j. Sommaire : Découvrir Neo4j Comprendre Neo4j Évaluer Neo4j : CYPHER Évaluer Neo4j : API REST Apprendre et partager avec les graphgists Annexes Aide-mémoire CYPHER Glossaire
Vous avez déjà une idée de ce qu'est Neo4j et de son mode de fonctionnement, peut-être même avez-vous déjà fait quelques tests. La question maintenant est de savoir comment le mettre en œuvre dans un contexte de production. C'est ce que vous explique ce livre. Conçu sous la forme originale d'un dialogue entre les différents membres d'une équipe technique spécialisée dans la manipulation de données fortement connectées, il s'efforce de répondre à toutes les questions qu'on peut se poser lors de la mise en place d'un projet, de la preuve de concept jusqu'à la maintenance et la sécurisation. Des annexes, ainsi que de multiples encadrés au fil du récit viennent compléter certains aspects techniques. Vous y apprendrez : ・ à analyser un problème en termes de graphe ・ à intégrer Neo4j dans un système déjà existant et interagir avec d'autres bases de données ・ à importer ou exporter des données ・ à garantir la disponibilité et la sécurité des données ・ à surveiller et maintenir Neo4j Si vous débutez avec Neo4j, nous vous recommandons de commencer par le volume I. Prise en main. "C'est sous forme théâtrale que j'ai choisi de relater le déploiement de Neo4j, car n'est-ce pas finalement ce à quoi ressemble notre vie professionnelle ? Un petit théâtre." - Sylvain Roussy "Le futur de Neo4j est plein d'innovation et de possibilités : Mervaillie, Roussy et Rouyer ont écrit un ouvrage abouti qui vous aidera à en tirer tout le potentiel." - Jim Webber, Chief Scientist de Neo4j, Inc.
Author: Rik Van Bruggen
Publisher: Packt Publishing Ltd
This book is for developers who want an alternative way to store and process data within their applications. No previous graph database experience is required; however, some basic database knowledge will help you understand the concepts more easily.
Author: Ryan Stephens, Ronald Plew
Publisher: Sams Publishing
Database Design is the book you need to master the fundamentals of relational database design in today's ever-evolving world of database technologies. This book takes an approach to database design to teach the reader how to reach into the inner depths of an organization to understand the business needs, data, and daily processes that will all blend together to formulate a successful database. Much emphasis is placed on logical design as it is imperative to understand the inner workings of an organization to produce the highest quality database, while proactively eliminating future problems that are not so easy for a beginner to foresee. Additionally, topics such as change control, business rule integration, database security implementation, and legacy database redesign are covered in detail. Examples of SQL code are shown to portray implementation tasks of a relational database. Examples are compliant with ANSI standard.
Data science libraries, frameworks, modules, and toolkits are great for doing data science, but they’re also a good way to dive into the discipline without actually understanding data science. In this book, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch. If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with hacking skills you need to get started as a data scientist. Today’s messy glut of data holds answers to questions no one’s even thought to ask. This book provides you with the know-how to dig those answers out. Get a crash course in Python Learn the basics of linear algebra, statistics, and probability—and understand how and when they're used in data science Collect, explore, clean, munge, and manipulate data Dive into the fundamentals of machine learning Implement models such as k-nearest Neighbors, Naive Bayes, linear and logistic regression, decision trees, neural networks, and clustering Explore recommender systems, natural language processing, network analysis, MapReduce, and databases
Author: Shashank Tiwari
Publisher: John Wiley & Sons
A hands-on guide to leveraging NoSQL databases NoSQL databases are an efficient and powerful tool for storing and manipulating vast quantities of data. Most NoSQL databases scale well as data grows. In addition, they are often malleable and flexible enough to accommodate semi-structured and sparse data sets. This comprehensive hands-on guide presents fundamental concepts and practical solutions for getting you ready to use NoSQL databases. Expert author Shashank Tiwari begins with a helpful introduction on the subject of NoSQL, explains its characteristics and typical uses, and looks at where it fits in the application stack. Unique insights help you choose which NoSQL solutions are best for solving your specific data storage needs. Professional NoSQL: Demystifies the concepts that relate to NoSQL databases, including column-family oriented stores, key/value databases, and document databases. Delves into installing and configuring a number of NoSQL products and the Hadoop family of products. Explains ways of storing, accessing, and querying data in NoSQL databases through examples that use MongoDB, HBase, Cassandra, Redis, CouchDB, Google App Engine Datastore and more. Looks at architecture and internals. Provides guidelines for optimal usage, performance tuning, and scalable configurations. Presents a number of tools and utilities relating to NoSQL, distributed platforms, and scalable processing, including Hive, Pig, RRDtool, Nagios, and more.
Addresses the Challenges Facing Public Transport Policy Makers and Operators Public Transit Planning and Operation: Modeling, Practice and Behavior, Second Edition offers new solutions for delivering both better services and greater efficiency, solutions which have been developed and tested by the author in over thirty years of research work with mass transit policy makers and operators all over the world. It bridges the worlds of practice and research and academia, provides an overview and a critique of currently used operational planning methods, and furnishes innovative practical techniques and modeling. Improve Service Performance and Successfully Manage the Costs of Operation This new edition brings in new material on timetabling and vehicle scheduling with different vehicle sizes, new methods of designing transit route networks, analysis of transit coordination and connectivity, behavioral aspects of passengers including when making transfers, and innovative methods related to automation and optimization which can be used in real time to significantly improve service reliability. Combines academic research with real-world project experience Focuses on issues encountered in practice Provides unique coverage of the field Public Transit Planning and Operation: Modeling, Practice and Behavior, Second Edition incorporates a series of themes and new ways of thinking about planning and operation. Bridging the gap between theory and application, this text outlines the factors affecting public-transport services, addresses common problems, and offers practical solutions for improvement.
Run blazingly fast queries on complex graph datasets with the power of the Neo4j graph databaseAbout This Book* Get acquainted with graph database systems and apply them in real-world use cases* Use Cypher query language, APOC and other Neo4j extensions to derive meaningful analysis from complex data sets.* A practical guide filled with ready to use examples on querying, graph processing and visualizing information to build smarter spatial applications.Who This Book Is ForThis book is for developers who want an alternative way to store and process data within their applications. No previous graph database experience is required; however, some basic database knowledge will help you understand the concepts more easily.What You Will Learn* Understand the science of graph theory, databases and its advantages over traditional databases.* Install Neo4j, model data and learn the most common practices of traversing data* Learn the Cypher query language and tailor-made procedures to analyze and derive meaningful representations of data* Improve graph techniques with the help of precise procedures in the APOC library* Use Neo4j advanced extensions and plugins for performance optimization.* Understand how Neo4j's new security features and clustering architecture are used for large scale deployments.In DetailNeo4j is a graph database that allows traversing huge amounts of data with ease. This book aims at quickly getting you started with the popular graph database Neo4j.Starting with a brief introduction to graph theory, this book will show you the advantages of using graph databases along with data modeling techniques for graph databases. You'll gain practical hands-on experience with commonly used and lesser known features for updating graph store with Neo4j's Cypher query language. Furthermore, you'll also learn to create awesome procedures using APOC and extend Neo4j's functionality, enabling integration, algorithmic analysis, and other advanced spatial operation capabilities on data.Through the course of the book you will come across implementation examples on the latest updates in Neo4j, such as in-graph indexes, scaling, performance improvements, visualization, data refactoring techniques, security enhancements, and much more. By the end of the book, you'll have gained the skills to design and implement modern spatial applications, from graphing data to unraveling business capabilities with the help of real-world use cases.Style and approachA step-by-step approach of adopting Neo4j, the world's leading graph database. This book includes a lot of background information, helps you grasp the fundamental concepts behind this radical new way of dealing with connected data, and will give you lots of examples of use cases and environments where a graph database would be a great fit
Author: Pramod J. Sadalage, Martin Fowler
Publisher: Pearson Education
The need to handle increasingly larger data volumes is one factor driving the adoption of a new class of nonrelational “NoSQL” databases. Advocates of NoSQL databases claim they can be used to build systems that are more performant, scale better, and are easier to program. NoSQL Distilled is a concise but thorough introduction to this rapidly emerging technology. Pramod J. Sadalage and Martin Fowler explain how NoSQL databases work and the ways that they may be a superior alternative to a traditional RDBMS. The authors provide a fast-paced guide to the concepts you need to know in order to evaluate whether NoSQL databases are right for your needs and, if so, which technologies you should explore further. The first part of the book concentrates on core concepts, including schemaless data models, aggregates, new distribution models, the CAP theorem, and map-reduce. In the second part, the authors explore architectural and design issues associated with implementing NoSQL. They also present realistic use cases that demonstrate NoSQL databases at work and feature representative examples using Riak, MongoDB, Cassandra, and Neo4j. In addition, by drawing on Pramod Sadalage's pioneering work, NoSQL Distilled shows how to implement evolutionary design with schema migration: an essential technique for applying NoSQL databases. The book concludes by describing how NoSQL is ushering in a new age of Polyglot Persistence, where multiple data-storage worlds coexist, and architects can choose the technology best optimized for each type of data access.
Machine learning analyzes big data to uncover patterns invisible to humans. These technologies help Internet users find things online, make it possible to quickly translate speech, and create smarter video game opponents. Big data and machine learning are used everywhere in society, and the opportunities for their uses are endless.
Advanced Analytics with Spark
Author: Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills
Publisher: "O'Reilly Media, Inc."
In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming. You’ll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques—including classification, clustering, collaborative filtering, and anomaly detection—to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you’ll find the book’s patterns useful for working on your own data applications. With this book, you will: Familiarize yourself with the Spark programming model Become comfortable within the Spark ecosystem Learn general approaches in data science Examine complete implementations that analyze large public data sets Discover which machine learning tools make sense for particular problems Acquire code that can be adapted to many uses
Author: Lars George
Publisher: "O'Reilly Media, Inc."
If your organization is looking for a storage solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. As the open source implementation of Google's BigTable architecture, HBase scales to billions of rows and millions of columns, while ensuring that write and read performance remain constant.HBase: The Definitive Guideprovides the details you require, whether you simply want to evaluate this high-performance, non-relational database, or put it into practice right away. HBase's adoption rate is beginning to climb, and several IT executives are asking pointed questions about this high-capacity database. This is the only book available to give you meaningful answers. Learn how to distribute large datasets across an inexpensive cluster of commodity servers Develop HBase clients in many programming languages, including Java, Python, and Ruby Get details on HBase's primary storage system, HDFS—Hadoop’s distributed and replicated filesystem Learn how HBase's native interface to Hadoop’s MapReduce framework enables easy development and execution of batch jobs that can scan entire tables Discover the integration between HBase and other facets of the Apache Hadoop project
Neo4j in Action
Author: Jonas Partner, Aleksa Vukotic, Nicki Watt, Tareq Abedrabbo, Dominic Fox
Publisher: Manning Publications
Summary Neo4j in Action is a comprehensive guide to Neo4j, aimed at application developers and software architects. Using hands-on examples, you'll learn to model graph domains naturally with Neo4j graph structures. The book explores the full power of native Java APIs for graph data manipulation and querying. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Technology Much of the data today is highly connected—from social networks to supply chains to software dependency management—and more connections are continually being uncovered. Neo4j is an ideal graph database tool for highly connected data. It is mature, production-ready, and unique in enabling developers to simply and efficiently model and query connected data. About the Book Neo4j in Action is a comprehensive guide to designing, implementing, and querying graph data using Neo4j. Using hands-on examples, you'll learn to model graph domains naturally with Neo4j graph structures. The book explores the full power of native Java APIs for graph data manipulation and querying. It also covers Cypher, Neo4j's graph query language. Along the way, you'll learn how to integrate Neo4j into your domain-driven app using Spring Data Neo4j, as well as how to use Neo4j in standalone server or embedded modes. Knowledge of Java basics is required. No prior experience with graph data or Neo4j is assumed. What's Inside Graph database patterns How to model data in social networks How to use Neo4j in your Java applications How to configure and set up Neo4j About the Authors Aleksa Vukotic is an architect specializing in graph data models. Nicki Watt, Dominic Fox, Tareq Abedrabbo, and Jonas Partner work at OpenCredo, a Neo Technology partner, and have been involved in many projects using Neo4j. Table of Contents PART 1 INTRODUCTION TO NEO4J A case for a Neo4j database Data modeling in Neo4j Starting development with Neo4j The power of traversals Indexing the data PART 2 APPLICATION DEVELOPMENT WITH NEO4J Cypher: Neo4j query language Transactions Traversals in depth Spring Data Neo4j PART 3 NEO4J IN PRODUCTION Neo4j: embedded versus server mode