Skip to content

kunickiaj/datacollector-docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Dockerfiles for StreamSets Data Collector

Basic Usage

docker run -P -d streamsets/datacollector

Detailed Usage

  • You can specify a custom configs by mounting them as a volume to /etc/sdc or /etc/sdc/
  • You should specify a data volume for your pipelines and pipeline state. The default configured location is /data. You can override this location by passing a different value to the environment variable SDC_DATA
  • You can also specify your own explicit port mappings, or arguments to the streamsets command.

For example to run with a customized sdc.properties file, a local filsystem path to store pipelines, and statically map the default UI port you could use the following:

docker run -v $PWD/sdc.properties:/etc/sdc/sdc.properties:ro -v $PWD/sdc-data:/data:rw -p 18630:18630 -d streamsets/datacollector dc

Creating a Data Container

To create a dedicated data-only container for the pipeline store issue the following command:

docker create -v /data --name sdc-data streamsets/datacollector:latest /bin/true

You can then use the --volumes-from argument to mount it when you start the data collector.

docker run --volumes-from sdc-data -P -d streamsets/datacollector dc

About

Dockerfiles for StreamSets Data Collector

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages