"0 Preparation Background Multiple or diverse-source mixed computations are common needs. They ma .."

lisongbo RaqForum 82 No.
1 Reply • 2 View • 1 Days ago

SPL Lightweight Multisource Mixed Computation Practices

0 Preparation

Background

Multiple or diverse-source mixed computations are common needs. They may occur between different types of databases, between files and databases, and between NoSQL databases and files. Theoretically, such computations and analyses might exist between any data storages. The now known technologies cannot handle mixed computations well, even satisfactorily. Though some database products support mixed computations between same-type databases, it is difficult to perform those computations between completely different types of data sources. Logical data warehouses can implement the multi/diverse-source mixed computations to some extent because most of them are SQL-based and thus can access RDB data sources through table mapping. Accesses to the other types of data sources become hard, and data virtualization, which is complex, is needed. But even so not all data sources can be accessed. Moreover, logical data warehouse architecture is so heavy that usually it is even more complicated than the application itself. They only suit large-scale computing scenarios.

esProc SPL supports a rich variety of data sources, which, once successfully connected, will be transformed to uniform data objects (table sequence or cursor), laying the natural foundation for mixed computations between any data sources, as long as they can be accessed by SPL. Being very lightweight, SPL can be embedded in an application and equip the latter with the multi/diverse-source mixed computation ability. Moreover, SPL even outstrips SQL with its succinct syntax. Using SPL to perform multisource mixed computations brings both computing ability and convenient engineering implementation.

SPL provides different connectors for different data sources. Below is the SPL-led computing architecture:

There are two types of SPL data source connectors – native and external. Native connectors are built as SPL’s core components. They include the most commonly seen RDBs, local files such as text, Excel and JSON, and the HTTP source. Various other data sources, such as MongoDB, Kafka, ElasticSearch and cloud storages, are external connectors, which are not SPL core components and need to be specifically deployed.

Below is a list of commonly used data sources SPL supports:

SPL native connectors support accessing JDBC data sources such as MySQL and Oracle, local files such as CSV, Excel and JSON, Web data sources such as HTTP and RestAPI, remote files, and other sources.

Our discussion about SPL multisource mixed computations will cover the following parts:

Prepare environment

esProc SPL

Download and install esProc SPL Standard Edition .

Databases

MySQL

Download and install MySQL.

MongoDB

Install and configure MySQL.

Download data files

data.rar
program.rar

SPL Official Website 👉 https://www.esproc.com

SPL Feedback and Help 👉 https://www.reddit.com/r/esProcSPL

SPL Learning Material 👉 https://c.esproc.com

SPL Source Code and Package 👉 https://github.com/SPLWare/esProc

Discord 👉 https://discord.gg/sxd59A8F2W

Youtube 👉 https://www.youtube.com/@esProc_SPL

esProc

lisongbo • 2 View • 1 Days ago

SPL Lightweight Multisource Mixed Computation Practices

0 Preparation

Background

Contents

Practices #1: Running SQL in RDBs

Practices #2: Querying CSV/XLS and other files

Practices #3: Querying Restful/JSON data

Practices #4: Querying MongoDB

Practices #5: Cross-datasource union and comparison

Practice #6: Cross-datasource JOIN

Practice #7: SQL migration