CG Enterprise is a powerful and intuitive solution for web data extraction that has unparalleled support for large-scale web data extraction operations. It has been specifically designed for corporations with a critical reliance on structured web data, legal compliance and those who demand data quality and reliability.
CG Enterprise succeeds where most competitor solutions fail. Its advanced features ensure that you can extract content from complex websites while also being intuitive and user-friendly. It includes sophisticated features for monitoring data extraction success criteria, legal compliance and production fail-over that aren’t available in other solutions.
CG Enterprise includes the full suite of components to run large-scale web data extraction operations within your own cloud or data center environment.
For development & maintenance of web data extraction agents.
For running agents in production environments. The Server license also includes the Agent Control Center, which provides a centralized platform for large-scale web data extraction operations.
Your license needs depend on the size of your web data extraction operation. If you are starting out small and just want a single machine license for developing agents, you will initially need CG Enterprise for Desktop.
As your operation expands, or if you need separate development and production environments, then you can take advantage of the centralized operational controls you get from CG Enterprise for Server and the Agent Control Center.
Enterprise for Desktop is essential for the development of web data extraction agents. It provides both a development platform for producing web data extraction agents and a production run-time. This license alone can be used for single server operations. One user license is required per developer, server or cloud. Enterprise for Desktop is the only license that can be used for web data extraction agent creation. If you are just beginning or are only after a single user license, you would only need CG Enterprise for Desktop.
Enterprise for Server is an optimized production run-time license which can also be used for basic maintenance of existing agents. Organizations who want separate development and production environments should use CG Enterprise for Server in conjunction with CG Enterprise for Desktop. When you purchase a CG Enterprise for Server license, you also get the Agent Control Center. A single Enterprise for Server license is required per server machine or cloud instance.
The visual point and click editor is easy to use even for non-technical users. Automatically detects and configures all commands types. Browser-like view of website data. Often no coding is required, custom code can be added at any point in the workflow.
Powerful testing and debugging features help you build reliable agents.Solid error handling and error recovery will keep the agents running in the most difficult scenarios.
Easily scale with multiple sessions running in parallel and work distributed across multiple servers/clouds.
Embed the CG Enterprise runtime into your own software
Call the CG Enterprise Rest API from anywhere
Export directly into third-party Data Analytics / Visualization tools
Easily shift your operation from an outsourced services model to in-house without needing to start again.
Scripting can be used for more precise control if you have unusual requirements or for process tuning.
For organizations that rely on web data as an input to their own data products, CG Enterprise helps ensure strict compliance to website data usage terms. Agent configurations are stored in version control with changes tracked, supporting an audit ready operation and clear control over key concerns like rate or type of requests being made, making it easy to comply with pre-defined operating guidelines. An agent can even be configured to halt all data collection if requests are not in compliance with the target website’s robots.txt file.
You can run CG Enterprise on your own infrastructure to develop agents and extract content from as many websites as you like. There are no restrictions on the number of agents, page loads or websites to extract from and there are no monthly data fees. You can also control your own data security.
Export data in numerous formats including Excel, CSV, JSON, XML, PDF, MYSQL, SQL Server, Oracle, Apache Parquet, MongoDB, Cosmos and most other databases via OleDB. Ability to deliver data to many local and cloud object stores (i.e. Amazon AWS S3, Azure, Google Drive/Cloud, Dropbox, SFTP, Email). Data de-duplication & the ability to write directly to custom data structures.