StreamSets

↧

Image may be NSFW.
Clik here to view.

Visualizing and Analyzing Salesforce Data with Neo4j

May 16, 2017, 11:19 am

Graph databases represent and store data in terms of nodes, edges and properties, allowing quick, easy retrieval of complex hierarchical structures that may be difficult to model in traditional...

View Article

Image may be NSFW.
Clik here to view.

Embrace Diversity in Your Data Architecture

June 8, 2017, 9:17 pm

Many Roads Lead to Rome Over the last ten years, the data management landscape has changed dramatically — on that, I think we can all agree. The rise of big data and the new data management ecosystem...

View Article

Announcing Data Collector ver 2.6.0.0

June 12, 2017, 3:50 pm

We are excited to announce version 2.6 of StreamSets Data Collector. This release has important functionality focused on helping customers to modernize their enterprise data warehouses on Hadoop,...

View Article

Image may be NSFW.
Clik here to view.

Introducing the Data Collector Support Bundle

June 13, 2017, 7:47 am

Hi, my name is Wagner Camarao and I'm a Software Engineer at StreamSets focusing on the user-facing aspects of our products. Today I'm going to talk about a new feature in the StreamSets Data Collector...

View Article

Image may be NSFW.
Clik here to view.

Triggering Databricks Notebook Jobs from StreamSets Data Collector

June 20, 2017, 6:45 pm

Last December, I covered Continuous Data Integration with StreamSets Data Collector and Spark Streaming on Databricks. In StreamSets Data Collector (SDC) version 2.5.0.0 we added the Spark Executor,...

View Article

Image may be NSFW.
Clik here to view.

Cache Salesforce Data in Redis with StreamSets Data Collector

July 6, 2017, 8:03 am

Redis is an open-source, in-memory, NoSQL database implementing a networked key-value store with optional persistence to disk. Perhaps the most popular key-value database, Redis is widely used for...

View Article

Image may be NSFW.
Clik here to view.

Scaling out StreamSets with Kubernetes

July 13, 2017, 11:01 pm

In today’s microservice revolution, where software applications are designed as independent services that work together, two technologies stand out. Docker, the defacto standard for containerization,...

View Article

Announcing Data Collector v2.7.0.0

August 21, 2017, 11:16 am

We have discovered a regression with the 2.7.0.0 release build and have decided to remove the build from distribution. The bug has been fixed and we are in the process of releasing an update as soon...

View Article

Image may be NSFW.
Clik here to view.

Fast, Easy Access to Secure Kafka Clusters

August 28, 2017, 10:01 am

It’s simple to connect StreamSets Data Collector (SDC) to Apache Kafka through the Kafka Consumer Origin and Kafka Producer Destination connectors. And because those connectors support all Kafka Client...

View Article

Announcing Data Collector v2.7.1.0

August 30, 2017, 11:23 am

We are happy to announce version 2.7.1.0 of StreamSets Data Collector. This release has a number of new features, improvements and bug fixes. For a list of all our new features, please see What's New....

View Article

Image may be NSFW.
Clik here to view.

Getting Started with StreamSets Data Collector on Docker

September 5, 2017, 6:00 am

‘Simplicity is the ultimate sophistication.’ – Leonardo da Vinci As a recent hire on the Engineering Productivity team here at StreamSets, my early days at the company were marked by efforts to dive...

View Article

Image may be NSFW.
Clik here to view.

Ask StreamSets: Questions and Answers for the StreamSets Community

September 6, 2017, 10:50 am

It's fair to say that most developers are familiar with Stack Overflow and the Stack Exchange network of question and answer sites. Q&A sites such as Stack Overflow serve communities of users...

View Article

Image may be NSFW.
Clik here to view.

Straight from Our Customers: The Benefits of Modern Ingestion

October 20, 2017, 6:00 am

Three months into my journey here at StreamSets and I’ve had a chance to talk with many of our customers and prospects to understand how they are using the open source StreamSets Data Collector (SDC)...

View Article

Image may be NSFW.
Clik here to view.

Getting Started with Cloudera’s Cybersecurity Solution (feat. StreamSets,...

October 23, 2017, 8:50 am

This post was originally published on the Cloudera VISION blog by Sam Heywood. StreamSets configurations and images of Apache Spot Open Data Model ingest pipelines can be found here on Github. A...

View Article

Image may be NSFW.
Clik here to view.

Evolving Avro Schemas with Apache Kafka and StreamSets Data Collector

October 25, 2017, 11:51 am

Apache Avro is widely used in the Hadoop ecosystem for efficiently serializing data so that it may be exchanged between applications written in a variety of programming languages. Avro allows data to...

View Article

Image may be NSFW.
Clik here to view.

How to Convert Apache Sqoop™ Commands Into StreamSets Data Collector Pipelines

October 26, 2017, 4:06 pm

When it comes to loading data into Apache Hadoop™, the de facto choice for bulk loads of data from leading relational databases is Apache Sqoop™. After initially entering Apache Incubator status in...

View Article

Image may be NSFW.
Clik here to view.

Fun with FileRefs – Manipulating Whole File Data

November 2, 2017, 3:15 pm

As well as parsing incoming data into records, many StreamSets Data Collector (SDC) origins can be configured to ingest Whole Files. The blog entry Whole File Transfer with StreamSets Data Collector...

View Article

Image may be NSFW.
Clik here to view.

Bulk Loading Data into Snowflake Data Warehouse

November 17, 2017, 10:03 am

Mike Fuller, a consultant at Red Pill Analytics, has been busy integrating an Oracle RDS database with Snowflake's cloud data warehouse via StreamSets Data Collector. His blog post on bulk loading data...

View Article

Announcing StreamSets Data Collector version 3.0

November 27, 2017, 9:23 pm

Version 3.0 marks an important new milestone for StreamSets. With close to a million downloads and a strong community and customer base, we are very excited to offer a host of powerful new capabilities...

View Article

Image may be NSFW.
Clik here to view.

Announcing StreamSets Data Collector Edge

November 28, 2017, 1:39 am

Today an increasing amount of data is being generated from outside the data center or cloud – it isn’t always easy to get this data out of source systems or perform analytics right where it’s...

View Article