Releases: apache/texera
Releases · apache/texera
Apache Texera(Incubating) Release v1.1.0
Release Highlights
1. Feature Upgrades
- Advanced Computing Unit Feature: We refined the Kubernetes-based computing unit layer by improving the UI/UX and enabling shared-memory control, GPU support and JVM memory tuning (#3337, #3363, #3371, #3372, #3400, #3451, #3591, #3604).
- Dynamic Admin Configuration System: Site settings are now stored in the database, and admins can customize logos and favicons with a live preview of branding changes (#3453, #3480, #3700, #3743).
2. Operators
- New Visualization Operators: We added Choropleth Map, Bullet Chart, Dendrogram, Violin Plot, Volcano Plot, Nested Table, Range Slider, and Time Series Plot operators (#3352, #3356, #3362, #3374, #3397, #3418, #3531, #3744).
- Machine Learning Operators: We introduced a suite of scikit-learn training operators (#3535).
- Utility Operators: We added a Sleep operator, enhanced the Generator with examples, and improved the Projection operator with column re-ordering (#3537, #3543, #3661).
- Enhanced Visualization System: We organized operators with a new category hierarchy, improved grouping, and first-level category display (#3364, #3369).
3. Authentication & Security
- Admin Account Management: Admin accounts can be auto-registered from configuration, and admins can leave comments to users (#3266, #3387, #3625).
- Security Enhancements: The system tracks user activity with last-login times, separates JWT expiration from activity windows, and sends email alerts for unauthorized access attempts (#3391, #3393, #3541, #3693, #3694, #3703, #3731).
- Access Control Improvements: We refined JWT secret access logic, optimized token refresh behavior, and added login confirmation requirements (#3357, #3703).
4. Infrastructure & Deployment
- Apache Foundation Compliance: We added ASF headers, NOTICE and DISCLAIMER files, integrated Apache Rat checks in CI, and updated license headers (#3388, #3414, #3415, #3631, #3632, #3637).
- Kubernetes and Helm Enhancements: We added GCP deployment support, a DaemonSet for image pre-pulling, ingress class configuration, and improved job ordering (#3358, #3366, #3376, #3386, #3419, #3652).
- Build System Improvements: We automated jOOQ code generation in SBT, removed jOOQ from CI, and migrated builds to a JDK to support Java UDFs (#3628, #3670, #3689, #3746, #3747).
- Configuration Management: We centralized configuration in a dedicated service, added environment-variable defaults, and stored site settings in the database (#3458, #3466, #3622).
- Centralized Config Service: We introduced a centralized config-service with database-backed dynamic settings, enabling admins to modify branding, upload limits, authentication parameters, and system configurations through a web dashboard without service restarts or redeployment (#3453, #3458, #3466, #3480, #3511, #3522, #3533, #3622, #3664, #3693, #3700, #3743).
- Dependency Updates: We upgraded PyArrow from 19.0.1 to 21.0.0, updated jOOQ to 3.16.23, and updated Formly to 6.3.12 (#3368, #3565, #3600, #3687).
5. UI and UX Improvements
- Workflow Workspace Enhancements: The workspace now supports dynamic name-field resizing, double-click to open operator results, auto-centering, broken-workflow alerts, and reliable refresh behavior (#3342, #3359, #3573, #3603, #3606, #3636).
- Result Panel Improvements: The result panel shows a compact statistics row, supports resizable panels, exports to HTML and Arrow, and auto-expands on click (#3429, #3435, #3568).
- Visual Design Updates: We introduced a new Apache Texera logo, clarified button styling, added an adaptive navigation menu, and improved port statistics displays (#3481, #3483, #3486, #3489, #3569, #3585, #3601, #3722).
- Dataset UI Improvements: The dataset UI shows version creation dates with timezone tooltips, displays upload speed and time estimates, updates progress in real time, and validates file sizes (#3426, #3427, #3612, #3662, #3702, #3706).
- Admin Dashboard Enhancements: The admin dashboard includes user activity metrics, a better comment interface, column layout refinements, and a live branding preview (#3436, #3464, #3594, #3625, #3700).
- Editor Experience: The editor shows operator descriptions in panels, supports dynamic sidebar customization, improves edge deletion and reconnection, and organizes the operator menu by category (#3349, #3364, #3411, #3437, #3479, #3533).
v1.0.0
Since 2016, our team has been developing the Texera system (https://texera.io/) to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!
Major Features
- Supporting low/no coding data science using workflows
- Parallel data-processing engine running on computing clusters
- Using the Apache Pekko actor-model system
- Supporting UDFs in Python, R, and Java
- Supporting ML training and inference
- Including a rich collection of ML operators
- Interactive workflow execution model that supports pausing and resuming
- Supporting collaborations with shared editing, shared execution, and version control
- Supporting debugging, including line-by-line debugging in Python UDFs
- Supporting reproducibility of data analysis
- Region-by-region execution with full pipelining in each region
- Storing execution results using Apache Iceberg
- Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
- Adopting a microservice-based architecture using Kubernetes and Docker
- Supporting computing isolation and storage isolation of multiple tenants
For users and developers
Texera first beta release 0.1.0
After the renaming process is finished, we publish the first release of Texera (version 0.1.0)