CG ENTERPRISE Best-in-class Technology for Reliable Web Data Extraction at Scale

CG Enterprise is a powerful and intuitive solution for web data extraction that has unparalleled support for large-scale web data extraction operations. It has been specifically designed for corporations with a critical reliance on structured web data, legal compliance and those who demand data quality and reliability.

CG Enterprise succeeds where most competitor solutions fail. Its advanced features ensure that you can extract content from complex websites while also being intuitive and user-friendly. It includes sophisticated features for monitoring data extraction success criteria, legal compliance and production fail-over that aren’t available in other solutions.

CG Enterprise includes the full suite of components to run large-scale web data extraction operations within your own cloud or data center environment.

Licenses CG Enterprise consists of two core license types:

  • 1
    DESKTOP

    For development & maintenance of web data extraction agents.

  • 2
    SERVER

    For running agents in production environments. The Server license also includes the Agent Control Center, which provides a centralized platform for large-scale web data extraction operations.

    Your license needs depend on the size of your web data extraction operation. If you are starting out small and just want a single machine license for developing agents, you will initially need CG Enterprise for Desktop.

    As your operation expands, or if you need separate development and production environments, then you can take advantage of the centralized operational controls you get from CG Enterprise for Server and the Agent Control Center.

ENTERPRISE FOR DESKTOP

Enterprise for Desktop is essential for the development of web data extraction agents. It provides both a development platform for producing web data extraction agents and a production run-time. This license alone can be used for single server operations. One user license is required per developer, server or cloud. Enterprise for Desktop is the only license that can be used for web data extraction agent creation. If you are just beginning or are only after a single user license, you would only need CG Enterprise for Desktop.

ENTERPRISE FOR SERVER

Enterprise for Server is an optimized production run-time license which can also be used for basic maintenance of existing agents. Organizations who want separate development and production environments should use CG Enterprise for Server in conjunction with CG Enterprise for Desktop. When you purchase a CG Enterprise for Server license, you also get the Agent Control Center. A single Enterprise for Server license is required per server machine or cloud instance.

AGENT CONTROL CENTER (ACC) is included with CG Enterprise for Server licenses. The ACC provides a fully managed end-to-end enterprise data extraction platform. It was developed to facilitate large-scale web data extraction operations with multiple users. The ACC provides centralized management of all agents, servers, security, software updates, schedules, deployments, user access, proxy pools, support tickets, and more; all within your own cloud or data center environment.

The ACC seamlessly integrates with the CG Enterprise for Desktop and CG Enterprise for Server licenses. The following diagram shows the combination of the CG Enterprise for Desktop, CG Enterprise for Server and the Agent Control Center, providing a fully managed end-to-end enterprise data extraction solution.

Key Features

USABILITY

USABILITY

The visual point and click editor is easy to use even for non-technical users. Automatically detects and configures all commands types. Browser-like view of website data. Often no coding is required, custom code can be added at any point in the workflow.

RELIABILITY & SCALABILITY

RELIABILITY & SCALABILITY

Powerful testing and debugging features help you build reliable agents.Solid error handling and error recovery will keep the agents running in the most difficult scenarios.
Easily scale with multiple sessions running in parallel and work distributed across multiple servers/clouds.

INTEGRATION

INTEGRATION

Embed the CG Enterprise runtime into your own software
Call the CG Enterprise Rest API from anywhere
Export directly into third-party Data Analytics / Visualization tools

FLEXIBILITY

FLEXIBILITY

Easily shift your operation from an outsourced services model to in-house without needing to start again.
Scripting can be used for more precise control if you have unusual requirements or for process tuning.

LEGAL COMPLIANCE

LEGAL COMPLIANCE

For organizations that rely on web data as an input to their own data products, CG Enterprise helps ensure strict compliance to website data usage terms. Agent configurations are stored in version control with changes tracked, supporting an audit ready operation and clear control over key concerns like rate or type of requests being made, making it easy to comply with pre-defined operating guidelines. An agent can even be configured to halt all data collection if requests are not in compliance with the target website’s robots.txt file.

NO LIMITS!

NO LIMITS!

You can run CG Enterprise on your own infrastructure to develop agents and extract content from as many websites as you like. There are no restrictions on the number of agents, page loads or websites to extract from and there are no monthly data fees. You can also control your own data security.

DATA MANAGEMENT

DATA MANAGEMENT

Export data in numerous formats including Excel, CSV, JSON, XML, PDF, MYSQL, SQL Server, Oracle, Apache Parquet, MongoDB, Cosmos and most other databases via OleDB. Ability to deliver data to many local and cloud object stores (i.e. Amazon AWS S3, Azure, Google Drive/Cloud, Dropbox, SFTP, Email). Data de-duplication & the ability to write directly to custom data structures.

CG Enterprise Feature List


DOWNLOAD PDF


CG ENTERPRISE PRODUCT DATASHEET


DOWNLOAD PDF

INTERESTED IN PRICING OR A TRIAL?


CONTACT US