Зарегистрироваться
Восстановить пароль
FAQ по входу

Apache Spark

Apache Spark (от англ. spark — искра, вспышка) — программный каркас с открытым исходным кодом для реализации распределённой обработки неструктурированных и слабоструктурированных данных, входящий в экосистему проектов Apache Hadoop.

Требуется помощь в преобразовании раздела Компьютерная литература

Если Вы компетентны в тематике этого раздела, то Вас, возможно, заинтересует обсуждение планируемых преобразований.

Доверенные пользователи и модераторы раздела

  • Без фильтрации типов файлов
Packt Publishing, 2017. — 356 p. — ISBN 978-1785885136. True PDF Key Features Exclusive guide that covers how to get up and running with fast data processing using Apache Spark Explore and exploit various possibilities with Apache Spark using real-world use cases in this book Want to perform efficient data processing at real time? This book will be your one-stop solution. Book...
  • №1
  • 10,72 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 356 p. — ISBN 978-1785885136. Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing...
  • №2
  • 13,57 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 356 p. — ISBN 978-1785885136. Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing...
  • №3
  • 13,50 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 666 p. — ASIN B01BKL1PD8. Simplify machine learning model implementations with Spark About This Book Solve the day-to-day problems of data science with Spark This unique cookbook consists of exciting and intuitive numerical recipes Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your data Who This Book Is For This book is...
  • №4
  • 8,84 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 666 p. — ASIN B01BKL1PD8. Simplify machine learning model implementations with Spark About This Book Solve the day-to-day problems of data science with Spark This unique cookbook consists of exciting and intuitive numerical recipes Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your data Who This Book Is For This book is...
  • №5
  • 9,17 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 666 p. — ASIN B01BKL1PD8. Simplify machine learning model implementations with Spark About This Book Solve the day-to-day problems of data science with Spark This unique cookbook consists of exciting and intuitive numerical recipes Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your data Who This Book Is For This book is...
  • №6
  • 22,66 МБ
  • добавлен
  • изменен
Packt Publishing Ltd., Birmingham, UK, 2016. — 325 p. — ISBN: 9781785884696. A handy reference guide for data analysts and data scientists to help to obtain value from big data analytics using Spark on Hadoop clusters Big Data Analytics book aims at providing the fundamentals of Apache Spark and Hadoop. All Spark components – Spark Core, Spark SQL, DataFrames, Data sets,...
  • №7
  • 10,21 МБ
  • добавлен
  • изменен
Sams Publishing, 2017. — 592 p. — ISBN-13 978-0-672-33851-9. Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage...
  • №8
  • 36,56 МБ
  • добавлен
  • изменен
Sams Publishing, 2017. — 592 p. — ISBN-13 978-0-672-33851-9. Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage...
  • №9
  • 27,62 МБ
  • добавлен
  • изменен
Sams Publishing, 2017. — 592 p. — ISBN-13 978-0-672-33851-9. Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage...
  • №10
  • 27,41 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 339 p. — ISBN 1785885650. — ASIN: B01CGKAILW. Key Features Perform data analysis and build predictive models on huge datasets that leverage Apache Spark Learn to integrate data science algorithms and techniques with the fast and scalable computing features of Spark to address big data challenges Work through practical examples on real-world problems...
  • №11
  • 13,00 МБ
  • добавлен
  • изменен
Manning Publications, 2016. — 472 p. Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine...
  • №12
  • 10,96 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 392 p. — ISBN 1785880101. True PDF Spark has emerged as the most promising big data analytics engine for data science professionals. The true power and value of Apache Spark lies in its ability to execute data science tasks with speed and accuracy. Spark’s selling point is that it combines ETL, batch analytics, real-time stream analysis, machine learning,...
  • №13
  • 4,51 МБ
  • добавлен
  • изменен
O'Reilly Media, 2020. — 107 р. — ISBN 978-1-4920-5004-9. 2nd New edition. Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data...
  • №14
  • 2,64 МБ
  • добавлен
  • изменен
O'Reilly Media, 2020. — 107 р. — ISBN 978-1-4920-5004-9. 2nd New edition. Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data...
  • №15
  • 894,95 КБ
  • добавлен
  • изменен
O'Reilly Media, 2020. — 107 р. — ISBN 978-1-4920-5004-9. 2nd New edition. Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data...
  • №16
  • 4,46 МБ
  • добавлен
  • изменен
O'Reilly Media, 2020. — 107 р. — ISBN 978-1-4920-5004-9. 2nd New edition. Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data...
  • №17
  • 1,58 МБ
  • добавлен
  • изменен
O'Reilly Media, 2020. — 107 р. — ISBN 978-1-4920-5004-9. 2nd New edition. Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data...
  • №18
  • 2,45 МБ
  • добавлен
  • изменен
Мануал от компании Databricks по использованию Apache Spark. Introduction Log Analysis with Spark Introduction to Apache Spark Importing Data Exporting Data Log Analyzer Application Twitter Streaming Language Classifier Collect a Dataset of Tweets Examine the Tweets and Train a Model Apply the Model in Real-time
  • №19
  • 556,28 КБ
  • добавлен
  • изменен
Apress, 2016. — 296 p. — ISBN: 9781484221747 This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology—Scala/Spark, Mesos, Akka, Cassandra, and Kafka—in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting,...
  • №20
  • 4,60 МБ
  • добавлен
  • изменен
Apress, 2016. — 296 p. — ISBN: 9781484221747. This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology—Scala/Spark, Mesos, Akka, Cassandra, and Kafka—in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting,...
  • №21
  • 2,35 МБ
  • добавлен
  • изменен
Apress, 2018. — 375 p. — ISBN 978-1-4842-2148-8. See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together. In the , the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he...
  • №22
  • 9,49 МБ
  • добавлен
  • изменен
Apress, 2018. — 375 p. — ISBN 978-1-4842-2148-8. See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together. In the , the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he...
  • №23
  • 4,90 МБ
  • добавлен
  • изменен
Apress, 2018. — 375 p. — ISBN 978-1-4842-2148-8; e-ISBN 978-1-4842-2149-5. See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together. In the , the author begins by creating a private cloud and then installs and examines Apache...
  • №24
  • 3,66 МБ
  • добавлен
  • изменен
Packt Publishing, 2015. - 318p. Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations. This book aims to take your limited knowledge of Spark to the...
  • №25
  • 17,76 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 142 p. — ASIN B07HRTNFZ9. No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark...
  • №26
  • 1,35 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 142 p. — ASIN B07HRTNFZ9. No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark...
  • №27
  • 2,38 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 142 p. — ASIN B07HRTNFZ9. !Code files only No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs...
  • №28
  • 1,40 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 142 p. — ASIN B07HRTNFZ9. No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark...
  • №29
  • 1,30 МБ
  • добавлен
  • изменен
Indianapolis, IN : Wiley, 2016. — 205 p. — ISBN 978-1-119-25404-1. Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept...
  • №30
  • 5,91 МБ
  • добавлен
  • изменен
Ganelin Ilya, Orhian Ema, Sasaki Kai, York Brennon. — Indianapolis, IN : Wiley, 2016. — 205 p. — ISBN 978-1-119-25404-1. Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you...
  • №31
  • 2,38 МБ
  • добавлен
  • изменен
Packt, 2019. — 322 p. — ISBN 1788994613. Speed up the design and implementation of deep learning solutions using Apache Spark Deep learning is a subset of machine learning where datasets with several layers of complexity can be processed. Hands-On Deep Learning with Apache Spark addresses the sheer complexity of technical and analytical parts and the speed at which deep learning...
  • №32
  • 18,02 МБ
  • добавлен
  • изменен
Packt, 2019. — 322 p. — ISBN 1788994613. Speed up the design and implementation of deep learning solutions using Apache Spark Deep learning is a subset of machine learning where datasets with several layers of complexity can be processed. Hands-On Deep Learning with Apache Spark addresses the sheer complexity of technical and analytical parts and the speed at which deep learning...
  • №33
  • 12,94 МБ
  • добавлен
  • изменен
Packt, 2019 — 322 p. — ISBN 1788994613. Speed up the design and implementation of deep learning solutions using Apache Spark Deep learning is a subset of machine learning where datasets with several layers of complexity can be processed. Hands-On Deep Learning with Apache Spark addresses the sheer complexity of technical and analytical parts and the speed at which deep learning...
  • №34
  • 2,81 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 352 p. — ISBN 978-1491960110. Data science teams looking to turn research into useful analytics applications require not only the right tools, but also the right approach if they’re to succeed. With the revised second edition of this hands-on guide, up-and-coming data scientists will learn how to use the Agile Data Science development methodology to build...
  • №35
  • 4,49 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 352 p. — ISBN 978-1491960110. Data science teams looking to turn research into useful analytics applications require not only the right tools, but also the right approach if they’re to succeed. With the revised second edition of this hands-on guide, up-and-coming data scientists will learn how to use the Agile Data Science development methodology to build...
  • №36
  • 4,31 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 352 p. — ISBN 978-1491960110. Data science teams looking to turn research into useful analytics applications require not only the right tools, but also the right approach if they’re to succeed. With the revised second edition of this hands-on guide, up-and-coming data scientists will learn how to use the Agile Data Science development methodology to build...
  • №37
  • 11,53 МБ
  • добавлен
  • изменен
O'Reilly Media, 2015. — 274 p. — e-ISBN: 978-1-4493-5904-1, ISBN 10: 1-4493-5904-3. Data in all domains is getting bigger. How can you work with it efficiently? This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java,...
  • №38
  • 7,82 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 358 p. — ISBN 978-1491943205. Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster...
  • №39
  • 2,15 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 358 p. — ISBN 978-1491943205. True PDF Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run...
  • №40
  • 7,00 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 358 p. — ISBN 978-1-491-94320-5. Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster...
  • №41
  • 2,32 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 358 p. — ISBN 978-1-491-94320-5. Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster...
  • №42
  • 2,15 МБ
  • добавлен
  • изменен
O'Reilly Media, 2017. — 358 p. — ISBN 978-1-491-94320-5. Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster...
  • №43
  • 5,68 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 797 p. — ISBN: 978-1785280849 Key Features Learn Scala's sophisticated type system that combines Functional Programming and object-oriented concepts Work on a wide array of applications from simple batch jobs to stream processing and machine learning Explore the most common as well as some complex use-cases to perform large-scale data analysis with Spark...
  • №44
  • 17,88 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 797 p. — ISBN 978-1785280849. Key Features Learn Scala's sophisticated type system that combines Functional Programming and object-oriented concepts Work on a wide array of applications from simple batch jobs to stream processing and machine learning Explore the most common as well as some complex use-cases to perform large-scale data analysis with Spark...
  • №45
  • 18,21 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 874 p. — ISBN 10 1785280848, 13 978-1785280849. Scala has been observing wide adoption over the past few years, especially in the field of data science and analytics. Spark, built on Scala, has gained a lot of recognition and is being used widely in productions. Thus, if you want to leverage the power of Scala and Spark to make sense of big data, this...
  • №46
  • 86,48 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 476 p. — ISBN 978-1-78588-874-8. Discover everything you need to build robust machine learning applications with Spark 2.0 Data processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application. Spark is capable of handling large-scale batch and streaming data to...
  • №47
  • 10,75 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 476 p. — ISBN 978-1-78588-874-8. Discover everything you need to build robust machine learning applications with Spark 2.0 Data processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application. Spark is capable of handling large-scale batch and streaming data to...
  • №48
  • 7,13 МБ
  • добавлен
  • изменен
2nd ed. — Packt Publishing, 2017. — 354 p. — ASIN B01MR4YF5G. Advanced analytics on your Big Data with latest Apache Spark 2.x About This Book An advanced guide with a combination of instructions and practical examples to extend the most up-to date Spark functionalities. Extend your data processing capabilities to process huge chunk of data in minimum time using advanced concepts...
  • №49
  • 10,70 МБ
  • добавлен
  • изменен
2nd ed. — Packt Publishing, 2017. — 354 p. — ASIN B01MR4YF5G. Advanced analytics on your Big Data with latest Apache Spark 2.x About This Book An advanced guide with a combination of instructions and practical examples to extend the most up-to date Spark functionalities. Extend your data processing capabilities to process huge chunk of data in minimum time using advanced concepts...
  • №50
  • 10,75 МБ
  • добавлен
  • изменен
2nd Edition. — Packt Publishing, 2017. — 345 p. Apache Spark is an in-memory, cluster-based, parallel processing system that provides a wide range of functionality such as graph processing, machine learning, stream processing, and SQL. This book aims to take your limited knowledge of Spark to the next level by teaching you how to expand your Spark functionality. The book opens...
  • №51
  • 29,65 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ASIN B01LY3N7ZO Key Features Perform big data processing with Spark—without having to learn Scala! Use the Spark Java API to implement efficient enterprise-grade applications for data processing and analytics Go beyond mainstream data processing by adding querying capability, Machine Learning, and graph processing using Spark Book Description...
  • №52
  • 3,34 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ISBN ASIN: B01LY3N7ZO. Key Features Perform big data processing with Spark—without having to learn Scala! Use the Spark Java API to implement efficient enterprise-grade applications for data processing and analytics Go beyond mainstream data processing by adding querying capability, Machine Learning, and graph processing using Spark Book...
  • №53
  • 3,45 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ASIN B01LY3N7ZO. Key Features Perform big data processing with Spark—without having to learn Scala! Use the Spark Java API to implement efficient enterprise-grade applications for data processing and analytics Go beyond mainstream data processing by adding querying capability, Machine Learning, and graph processing using Spark Book Description...
  • №54
  • 7,95 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. - 252p. - ASIN: B01GEUF1H6 True PDF Key Features Customize Apache Spark and R to fit your analytical needs in customer research, fraud detection, risk analytics, and recommendation engine development Develop a set of practical Machine Learning applications that can be implemented in real-life projects A comprehensive, project-based guide to improve and...
  • №55
  • 4,22 МБ
  • добавлен
  • изменен
Apress, 2018. — 393 p. — ISBN 978-1484235782. Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it. Along the way, you’ll discover...
  • №56
  • 5,55 МБ
  • добавлен
  • изменен
Apress, 2018. — 393 p. — ISBN 978-1484235782. Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it. Along the way, you’ll discover...
  • №57
  • 7,20 МБ
  • добавлен
  • изменен
O’Reilly Media, 2019. — 156 р. — ISBN 1491944242. To build analytics tools that provide faster insights, knowing how to process data in real time is a must, and moving from batch processing to stream processing is absolutely required. Fortunately, the Spark in-memory framework/platform for processing data has added an extension devoted to fault-tolerant stream processing: Spark...
  • №58
  • 3,89 МБ
  • добавлен
  • изменен
O’Reilly Media, 2019. — 156 р. — ISBN 1491944242. To build analytics tools that provide faster insights, knowing how to process data in real time is a must, and moving from batch processing to stream processing is absolutely required. Fortunately, the Spark in-memory framework/platform for processing data has added an extension devoted to fault-tolerant stream processing: Spark...
  • №59
  • 1,78 МБ
  • добавлен
  • изменен
O'Reilly Media, 2019. — 400 p. — ISBN 10 1491944242, 13 978-1491944240. Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write...
  • №60
  • 4,43 МБ
  • добавлен
  • изменен
Manning Publications, 2016. — 282 p. in color. — ISBN: 1617292524, 9781617292521 Spark GraphX in Action starts out with an overview of Apache Spark and the GraphX graph processing API. This example-based tutorial then teaches you how to configure GraphX and how to use it interactively. Along the way, you'll collect practical techniques for enhancing applications and applying...
  • №61
  • 17,16 МБ
  • добавлен
  • изменен
SIGMOD '15 Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data Spark SQL is a new module in Apache Spark that integrates relational processing with Spark’s functional programming API. Built on our experience with Shark, Spark SQL lets Spark programmers leverage the benefits of relational processing (e.g., declarative queries and optimized storage),...
  • №62
  • 536,69 КБ
  • добавлен
  • изменен
Springer, 2018. — 274 p. — ISBN 9811305498. The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements...
  • №63
  • 4,38 МБ
  • добавлен
  • изменен
Springer, 2018. — 274 p. — ISBN 9811305498. The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements...
  • №64
  • 8,49 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 550 p. Master the techniques and sophisticated analytics used to construct Spark-based solutions that scale to deliver production-grade data science products. Data science seeks to transform the world using data, and this is typically achieved through disrupting and changing real processes in real industries. In order to operate at this level you need to...
  • №65
  • 34,99 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 560 p. — ISBN 978-1-78588-214-2. Master the techniques and sophisticated analytics used to construct Spark-based solutions that scale to deliver production-grade data science products Data science seeks to transform the world using data, and this is typically achieved through disrupting and changing real processes in real industries. In order to operate...
  • №66
  • 4,20 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 560 p. — ISBN 978-1-78588-214-2. Master the techniques and sophisticated analytics used to construct Spark-based solutions that scale to deliver production-grade data science products Data science seeks to transform the world using data, and this is typically achieved through disrupting and changing real processes in real industries. In order to operate...
  • №67
  • 9,20 МБ
  • добавлен
  • изменен
Packt Publishing, 2015. - 206p. Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer—an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms. Are you a Python developer...
  • №68
  • 9,43 МБ
  • добавлен
  • изменен
NY: InfoQ, 2018. — 104 p. Apache Spark is an open-source big-data processing framework built around speed, ease of use, and sophisticated analytics. Spark has several advantages compared to other big-data and MapReduce technologies like Hadoop and Storm. It provides a comprehensive, unified framework with which to manage big-data processing requirements for datasets that are...
  • №69
  • 2,45 МБ
  • добавлен
  • изменен
Packt Publishing, 2015. — 338 p. — e-ISBN: 978-1-78328-852-6, ISBN 10: 1-78328-852-3 Apache Spark is a framework for distributed computing that is designed from the ground up to be optimized for low latency tasks and in-memory data storage. It is one of the few frameworks for parallel computing that combines speed, scalability, in-memory processing, and fault tolerance with ease...
  • №70
  • 4,74 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 452 p. — ISBN 978-1-78588-835-9. Design, implement, and deliver successful streaming applications, machine learning pipelines and graph applications using Spark SQL API In the past year, Apache Spark has been increasingly adopted for the development of distributed applications. Spark SQL APIs provide an optimized interface that helps developers build such...
  • №71
  • 16,85 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 452 p. — ISBN 978-1-78588-835-9. Design, implement, and deliver successful streaming applications, machine learning pipelines and graph applications using Spark SQL API In the past year, Apache Spark has been increasingly adopted for the development of distributed applications. Spark SQL APIs provide an optimized interface that helps developers build such...
  • №72
  • 17,16 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 452 p. — ISBN 978-1-78588-835-9. Design, implement, and deliver successful streaming applications, machine learning pipelines and graph applications using Spark SQL API In the past year, Apache Spark has been increasingly adopted for the development of distributed applications. Spark SQL APIs provide an optimized interface that helps developers build such...
  • №73
  • 40,58 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 474 p. — ISBN 978-1788474221. A solution-based guide to put your deep learning models into production with the power of Apache Spark Key Features Discover practical recipes for distributed deep learning with Apache Spark Learn to use libraries such as Keras and TensorFlow Solve problems in order to train your deep learning models on Apache Spark Book...
  • №74
  • 33,99 МБ
  • добавлен
  • изменен
Packt Publishing, 2018. — 474 p. — ISBN 978-1788474221. A solution-based guide to put your deep learning models into production with the power of Apache Spark Key Features Discover practical recipes for distributed deep learning with Apache Spark Learn to use libraries such as Keras and TensorFlow Solve problems in order to train your deep learning models on Apache Spark Book...
  • №75
  • 47,05 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ISBN 978-1-78712-649-7. Unleash the data processing and analytics capability of Apache Spark with the language of choice: Java Apache Spark is the buzzword in the big data industry right now, especially with the increasing need for real-time streaming and data processing. While Spark is built on Scala, the Spark Java API exposes all the Spark...
  • №76
  • 3,34 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ISBN 978-1-78712-649-7. Unleash the data processing and analytics capability of Apache Spark with the language of choice: Java Apache Spark is the buzzword in the big data industry right now, especially with the increasing need for real-time streaming and data processing. While Spark is built on Scala, the Spark Java API exposes all the Spark...
  • №77
  • 3,37 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 350 p. — ISBN 978-1-78712-649-7. Unleash the data processing and analytics capability of Apache Spark with the language of choice: Java Apache Spark is the buzzword in the big data industry right now, especially with the increasing need for real-time streaming and data processing. While Spark is built on Scala, the Spark Java API exposes all the Spark...
  • №78
  • 8,02 МБ
  • добавлен
  • изменен
Apress, 2019. — 288 p. — ISBN: 1484236513, 9781484236512. Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical...
  • №79
  • 23,31 МБ
  • добавлен
  • изменен
Syncfusion Inc., 2015. — 111 p. Mastering big data requires an aptitude at every step of information processing. Post-processing, one of the most important steps, is where you find Apache Spark frequently employed. Spark Succinctly, by Marko Švaljek, addresses Spark’s use in the ultimate step in handling big data. Topics included: - Introduction - Installing Spark - Hello...
  • №80
  • 3,40 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 323 p. — ISBN 978-1-78528-345-1. Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial The purpose of machine learning is to build systems that learn from data. Being able to understand trends and patterns in complex data is critical to success; it is one of the key strategies...
  • №81
  • 12,78 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 323 p. — ISBN 978-1-78528-345-1. Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial The purpose of machine learning is to build systems that learn from data. Being able to understand trends and patterns in complex data is critical to success; it is one of the key strategies...
  • №82
  • 7,54 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 323 p. — ISBN 978-1-78528-345-1. Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial The purpose of machine learning is to build systems that learn from data. Being able to understand trends and patterns in complex data is critical to success; it is one of the key strategies...
  • №83
  • 7,41 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 323 p. — ISBN 978-1-78528-345-1. Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial The purpose of machine learning is to build systems that learn from data. Being able to understand trends and patterns in complex data is critical to success; it is one of the key strategies...
  • №84
  • 36,79 МБ
  • добавлен
  • изменен
True PDF Packt Publishing, 2016. — 322 p. — ISBN 978-1-78588-500-6. Spark is one of the most widely-used large-scale data processing engines and runs extremely fast. It is a framework that has tools that are equally useful for application developers as well as data scientists. This book starts with the fundamentals of Spark 2 and covers the core data processing framework and API,...
  • №85
  • 21,81 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 332 p. Spark is one of the most widely-used large-scale data processing engines and runs extremely fast. It is a framework that has tools that are equally useful for application developers as well as data scientists. This book starts with the fundamentals of Spark 2 and covers the core data processing framework and API, installation, and application...
  • №86
  • 8,60 МБ
  • добавлен
  • изменен
Packt Publishing, 2016. — 326 p. — ISBN: 9781785884696 Big Data Analytics book aims at providing the fundamentals of Apache Spark and Hadoop. All Spark components – Spark Core, Spark SQL, DataFrames, Data sets, Conventional Streaming, Structured Streaming, MLlib, Graphx and Hadoop core components – HDFS, MapReduce and Yarn are explored in greater depth with implementation...
  • №87
  • 6,53 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 294 p. — ISBN-13 9781787127265. True PDf Over 70 recipes to help you use Apache Spark as your single big data computing platform and master its libraries. While Apache Spark 1.x gained a lot of traction and adoption in the early years, Spark 2.x delivers notable improvements in the areas of API, schema awareness, Performance, Structured Streaming, and...
  • №88
  • 14,55 МБ
  • добавлен
  • изменен
Packt Publishing, 2017. — 294 p. — ISBN-13 9781787127265. Over 70 recipes to help you use Apache Spark as your single big data computing platform and master its libraries. While Apache Spark 1.x gained a lot of traction and adoption in the early years, Spark 2.x delivers notable improvements in the areas of API, schema awareness, Performance, Structured Streaming, and simplifying...
  • №89
  • 6,08 МБ
  • добавлен
  • изменен
O'Reilly Media, 2018. — 608 p. — ISBN 978-1491912218. Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique...
  • №90
  • 7,52 МБ
  • добавлен
  • изменен
O'Reilly Media, 2018. — 608 p. — ISBN 978-1491912218. Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique...
  • №91
  • 8,60 МБ
  • добавлен
  • изменен
O'Reilly Media, 2018. — 608 p. — ISBN 978-1491912218. Only Code files! Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each...
  • №92
  • 181,05 МБ
  • добавлен
  • изменен
Apress, 2016. — 231 р. — ISBN 978-1-4842-4800. Learn the right cutting-edge skills and knowledge to leverage Spark Streaming to implement a wide array of real-time, streaming applications. Pro Spark Streaming walks you through end-to-end real-time application development using real-world applications, data, and code. Taking an application-first approach, each chapter introduces...
  • №93
  • 13,41 МБ
  • добавлен
  • изменен
Учебно-методическое пособие. — СПб.: Университет ИТМО, 2019. — 50 с. Учебно-методическое пособие содержит теоретический материал и примеры выполнения задач для курса «Введение в технологии обработки больших данных». Пособие составлено с учётом проведения лабораторных работ с помощью фреймворка Apache Spark. Содержание дисциплины охватывает круг вопросов, связанных с организацией...
  • №94
  • 2,81 МБ
  • добавлен
  • изменен
Учебно-методическое пособие. — СПб.: Университет ИТМО, 2019. — 50 с. Учебно-методическое пособие содержит теоретический материал и примеры выполнения задач для курса «Введение в технологии обработки больших данных». Пособие составлено с учётом проведения лабораторных работ с помощью фреймворка Apache Spark. Содержание дисциплины охватывает круг вопросов, связанных с организацией...
  • №95
  • 1,60 МБ
  • добавлен
  • изменен
ДМК Пресс, 2015. — 303 c. — ISBN: 5970603236, 9785970603239 Объем обрабатываемых данных во всех областях человеческой деятельности продолжает расти быстрыми темпами. Существуют ли эффективные приемы работы с ним? В этой книге рассказывается об Apache Spark, открытой системе кластерных вычислений, которая позволяет быстро создавать высокопроизводительные программы анализа...
  • №96
  • 15,69 МБ
  • добавлен
  • изменен
М.: ДМК Пресс, 2015. — 304 с. Объем обрабатываемых данных во всех областях человеческой деятельности продолжает расти быстрыми темпами. Существуют ли эффективные приемы работы с ним? В этой книге рассказывается об Apache Spark, открытой системе кластерных вычислений, которая позволяет быстро создавать высокопроизводительные программы анализа данных. С помощью Spark вы сможете...
  • №97
  • 15,68 МБ
  • добавлен
  • изменен
СПб.: Питер, 2017. — 272 с. В этой практичной книге четверо специалистов Cloudera по анализу данных описывают самодостаточные паттерны для выполнения крупномасштабного анализа данных при помощи Spark. Авторы комплексно рассматривают Spark, статистические методы и множества данных, собранные в реальных условиях, и на этих примерах демонстрируют решения распространенных...
  • №98
  • 3,05 МБ
  • добавлен
  • изменен
СПб.: Питер, 2017. — 272 с. В этой практичной книге четверо специалистов Cloudera по анализу данных описывают самодостаточные паттерны для выполнения крупномасштабного анализа данных при помощи Spark. Авторы комплексно рассматривают Spark, статистические методы и множества данных, собранные в реальных условиях, и на этих примерах демонстрируют решения распространенных...
  • №99
  • 5,62 МБ
  • добавлен
  • изменен
В этом разделе нет файлов.

Комментарии

В этом разделе нет комментариев.