Upgrade an Installation from the Tarball
Users with an enterprise account can upgrade to a full, common, or core tarball installation. Other users can upgrade to the common tarball installation and install additional stage libraries as needed.
To upgrade a full, common, or core installation from the tarball, perform the following steps:
Step 1. Shut Down the Previous Version
Step 2. Back Up the Previous Version
Step 3. Install the New Version
Step 4. Update Environment Variables
Step 5. Update the Configuration Files
Step 6. Install Additional Libraries for the Core Installation
Step 1. Shut Down the Previous Version
Stop all running pipelines and then shut down the previous version of Data Collector.
-
Use one of the following methods to stop all running pipelines:
-
If the Data Collector is not registered to work with StreamSets Control Hub, stop the pipelines using the Data Collector UI.
From the Data Collector Home page, select all running pipelines in the list and then click the Stop icon.
-
If the Data Collector is registered to work with StreamSets Control Hub, stop all jobs running on the Data Collector using the Control Hub UI.
From the Control Hub Jobs page, filter the jobs by engine and by engine label. Select all active jobs in the list and then click the Stop Jobs icon.
-
-
Use one of the following methods to shut down Data Collector:
- To use the command line for shutdown when Data Collector is started as a service, use the required command for your operating
system.
For CentOS 6, Oracle Linux 6, Red Hat Enterprise Linux 6, or Ubuntu 14.04 LTS, use:
service sdc stop
For CentOS 7, Oracle Linux 7, Red Hat Enterprise Linux 7, or Ubuntu 16.04 LTS, use:
systemctl stop sdc
-
To use the Data Collector UI, click . When the confirmation dialog box appears, click Yes.
- To use the command line for shutdown when Data Collector is started as a service, use the required command for your operating
system.
Step 2. Back Up the Previous Version
Before you install the new version, create a backup of the files in the previous version by copying and renaming the configuration, data, and resource directories. That way, you can continue to run the previous version if needed.
- SDC_CONF - The Data Collector configuration directory.
- SDC_DATA - The Data Collector directory for pipeline state and configuration information.
- SDC_EXTERNAL_RESOURCES - The Data Collector directory for external resources.
- SDC_RESOURCES - The Data Collector directory for runtime resource files.
-
STREAMSETS_LIBRARIES_EXTRA_DIR - The Data Collector directory for external libraries.
-
USER_LIBRARIES_DIR - The Data Collector directory for custom stages.
/etc/sdc3000
.For more information about these directories, see Data Collector Directories.
Step 3. Install the New Version
The instructions that you use to install the new version depend on whether you start Data Collector manually or as a service and on your operating system.
Users with an enterprise account can upgrade to a full, common, or core tarball installation. Other users can upgrade to the common tarball installation and install additional stage libraries as needed.
Installing from the Tarball (Manual Start)
Install the new version of the tarball.
Installing from the Tarball for Systems Using SysV (Service Start)
Operating systems that use the SysV init system include CentOS 6, Oracle Linux 6, Red Hat Enterprise Linux 6, or Ubuntu 14.04 LTS.
Install the new version of the tarball. Installing the full Data Collector as a service requires root privileges.
Installing from the Tarball for Systems Using Systemd (Service Start)
Operating systems that use the systemd init system include CentOS 7, Oracle Linux 7, Red Hat Enterprise Linux 7, or Ubuntu 16.04 LTS.
Install the new version of the tarball. Installing the full Data Collector as a service requires root privileges.
Step 4. Update Environment Variables
Update the Data Collector environment variables so that the new version of Data Collector uses a new configuration directory but the same working data, log, and resource directories as the previous version.
For example, your previous Data Collector
version used the directory /var/lib/sdc
to store the data files for
pipeline configuration and run details. When you upgrade, you configure the new
version of Data Collector
to use the same working directory /var/lib/sdc
for the data files.
As a result, the new version has access to the pipelines created in the previous
version.
Update the environment variables in the required file based on your installation type. For more information about the required file to edit, see Modifying Environment Variables.
Step 5. Update the Configuration Files
A new Data Collector
version can include new properties and configuration files required for Data Collector
to start or function properly. In the previous step, you updated the environment
configuration file so that the new version of Data Collector
uses the new configuration files stored in the $SDC_CONF
directory.
In this step, you’ll compare the previous and new versions of the configuration
files, and update the new files as needed with the same customized property
values.
For example when upgrading from version 3.0.0.0, you'd compare the files in your back
up directory, /etc/sdc3000
, with the files in the /etc/sdc
directory. Then update the new files in the /etc/sdc
directory with any customizations made in the previous files in the
/etc/sdc3000
directory.
Step 6. Install Additional Libraries for the Core Installation
If you upgraded a common or core installation of Data Collector, install the individual stage libraries that the upgraded pipelines require.
Step 7. Start the New Version of Data Collector
Start the new version of Data Collector.
- To start Data Collector manually
- Use the following command from the
$SDC_DIST
directory to run Data Collector as the system user account logged into the command prompt:bin/streamsets dc
- To start Data Collector as a service
- Use the required command for your operating system:
- For CentOS 6, Oracle Linux 6, Red Hat Enterprise Linux 6, or Ubuntu
14.04 LTS, use:
service sdc start
- For CentOS 7, Oracle Linux 7, Red Hat Enterprise Linux 7, or Ubuntu
16.04 LTS, use:
systemctl start sdc
- For CentOS 6, Oracle Linux 6, Red Hat Enterprise Linux 6, or Ubuntu
14.04 LTS, use: