WeDataSphere  by WeBankFinTech

Big data platform suite for financial applications

created 6 years ago
670 stars

Top 51.3% on sourcepulse

GitHubView on GitHub
Project Summary

WeDataSphere is a comprehensive, financial-grade big data platform suite designed for enterprise-level data application development and management. It offers a unified ecosystem of tools covering the entire data lifecycle, from import and cleaning to analysis, visualization, and machine learning, targeting data engineers, analysts, and ML practitioners.

How It Works

WeDataSphere is built on a layered architecture, featuring a computation middleware (Apache Linkis) that decouples applications from underlying engines like Spark and Flink, providing standardized interfaces and cross-engine context sharing. The platform integrates various specialized components, including DataSphere Studio for graphical development, Scriptis for interactive analysis, Qualitis for data quality, Schedulis for workflow scheduling, Exchangis for data exchange, Visualis for BI, Prophecis for ML, and Streamis for streaming applications. This modular approach aims to simplify complex big data operations and enhance reliability.

Quick Start & Requirements

  • Installation typically involves deploying multiple components. Specific commands depend on the chosen deployment method (e.g., Docker, Kubernetes).
  • Prerequisites include Java, Maven, and potentially Hadoop, Spark, Flink, and Kubernetes depending on the components used.
  • Official documentation and GitHub repositories for individual components are available for detailed setup.

Highlighted Details

  • "Financial grade" reliability with unified security control, containerization, and multi-tenant isolation.
  • Apache Linkis acts as a computation middleware, standardizing access to diverse data engines.
  • DataSphere Studio provides a unified UI for drag-and-drop development across the data application lifecycle.
  • Includes specialized tools for data quality, data exchange, BI visualization, and machine learning.

Maintenance & Community

  • The project is initiated by WeBank FinTech.
  • Community engagement is encouraged via GitHub issues and WeChat/QQ groups.

Licensing & Compatibility

  • Core components are open-sourced. Specific licenses for each component are not detailed in the README but are expected to be permissive for integration.

Limitations & Caveats

  • The README indicates that "more open-source WDS components? Coming soon...", suggesting the suite may not be fully open-sourced yet.
  • "Financial grade" claims require further validation regarding specific security certifications or compliance standards.
Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.