Releases: apache/texera
Releases · apache/texera
Apache Texera(incubating) Release v1.1.0-incubating
What's Changed
- feat(auth): removed token refresh functionality by @Sentiaus in #3765
- feat(access-control-service): AccessControlService added as a new micro service by @aicam in #3767
- fix: Persist & apply “Data Transfer Batch Size” from Workflow Settings by @mengw15 in #3770
- fix(op): fix incorrect names for Machine Learning Operators by @aglinxinyuan in #3776
- feat(op): add ML training operators for linear and logistic regression by @yunyad in #3779
- feat: cap number of concurrent file uploads by @xuang7 in #3735
- fix: Correctly format workflow JSON when downloading from hub by @colinthebomb1 in #3751
- fix(ui): Prevent link delete buttons from overlapping by @Ma77Ball in #3602
- chore: Add publish section with whoami to .asf.yaml by @pjfanning in #3785
- fix(gui): reopen workflow websocket when switching workflows by @seongjinyoon in #3773
- fix(ui): move quota diagrams to new tab by @gracecluvohio in #3781
- fix(ci): add explicit jdbc driver loading in relevant e2e tests by @Xiao-zhen-Liu in #3783
- fix: pin R arrow package to version 14.0.2.1 in computing-unit-master by @bobbai00 in #3761
- chore(deps): bump torch from 2.4.1 to 2.8.0 and typing_extensions from 4.9.0 to 4.10.0 by @mengw15 in #3800
- chore(deps): bump scikit-learn from 1.4.0 to 1.5.0 in /core/amber by @dependabot[bot] in #3799
- chore: refactor user avatar component by @aglinxinyuan in #3790
- chore(deps): bump axios from 1.7.7 to 1.12.2 in /core/gui by @dependabot[bot] in #3797
- chore(deps): bump form-data from 3.0.2 to 3.0.4 in /core/gui by @dependabot[bot] in #3793
- fix(debugger): do not truncate debug messages by @Yicong-Huang in #3786
- feat(config): allow default.conf to be set using environment variables by @bobbai00 in #3757
- feat: display datasets version creation timestamp by @xuang7 in #3803
- chore: Upgrade checkout and setup actions to latest versions by @pjfanning in #3808
- feat: add save button for sidebar tabs on the admin settings page by @xuang7 in #3801
- feat(dataset): allow datasets to have the same name across different users by @AnzhiZhang in #3715
- ci: automatically label PRs by @Yicong-Huang in #3536
- ci: fix labeler.yml glob pattern syntax by @Yicong-Huang in #3813
- ci: fix labeler configuration for file glob patterns by @Yicong-Huang in #3814
- feat(gui): display the owner's avatar next to each CU by @yunyad in #3784
- fix(ci): explicitly load jdbc driver when accessing iceberg postgres catalog by @Xiao-zhen-Liu in #3817
- fix(ci): narrow backend file patterns in labeler.yml by @Yicong-Huang in #3815
- fix: delete button only clickable from outer ring by @Ma77Ball in #3809
- chore: add bug report template by @Yicong-Huang in #3812
- chore(deps): bump @babel/helpers from 7.25.7 to 7.28.4 in /core/gui by @dependabot[bot] in #3795
- fix: conditionally show upload panels based on upload activity by @xuang7 in #3818
- fix(image): exec format error in pylsp Docker container by @bobbai00 in #3811
- fix(gui): keep words complete in the workspace' comment boxes by @colinthebomb1 in #3597
- chore: Improve Bug Issue Template by @kunwp1 in #3826
- chore(deps): bump transformers from 4.44.2 to 4.53.0 in /core/amber by @dependabot[bot] in #3635
- fix(deployment): deduplicate SQL files under
deploymentby using symlinks by @bobbai00 in #3824 - chore(ci): automatically assign an author of a PR by @Yicong-Huang in #3835
- feat(amber): enable user system by default by @Xiao-zhen-Liu in #3782
- chore(deps): bump prismjs from 1.29.0 to 1.30.0 in /core/gui by @dependabot[bot] in #3796
- fix(access-control-service): fix formatting of
AccessControlResourceby @aicam in #3838 - chore(ci): revert automatically assign an author of a PR by @Yicong-Huang in #3839
- fix(access-control-service): fix service file formats by @aicam in #3840
- feat(access-control-service): add
access-control-serviceto authorize the requests to/wsapiandComputing Unitendpoints by @aicam in #3598 - fix(file-service): avoid manually constructing the Dataset object through parameters by @bobbai00 in #3820
- refactor(config): remove user-sys enabled flag by @Xiao-zhen-Liu in #3831
- fix(gui): enable frontend undo-redo with bug-fix for shared editing by @Xiao-zhen-Liu in #3836
- feat: enable canceling pending uploads before upload begins by @xuang7 in #3847
- feat: improve
file-serviceDocker Compose file by @AnzhiZhang in #3834 - feat: prevent export of results from non-downloadable datasets by @seongjinyoon in #3772
- feat: switch workflow result downloads to use browser native downloads by @madisonmlin in #3728
- fix(ui): remove duplicate left sidebar border and extend to bottom of page by @gracecluvohio in #3849
- chore: rename packages to org.apache by @aglinxinyuan in #3848
- chore: relocate core/gui to frontend by @Yicong-Huang in #3851
- chore: relocate root project build out from core by @Yicong-Huang in #3852
- chore: relocate access-control-service by @Yicong-Huang in #3866
- chore: relocate computing-unit-managing-service by @Yicong-Huang in #3867
- fix: Clear Python executable path in udf.conf by @aglinxinyuan in #3869
- fix: use
TEXEAR_HOMEor the current directory as the default working directory by @Yicong-Huang in #3870 - fix: use frontend as the logo path by @Yicong-Huang in #3876
- chore: relocate workflow-compiling-service by @Yicong-Huang in #3868
- chore: relocate file-service by @Yicong-Huang in #3873
- chore: relocate config-service by @Yicong-Huang in #3874
- chore: relocate
core/pyright-language-serverand rename it topyright-language-serviceby @Yicong-Huang in #3875 - chore: rename
core/scriptstobinby @Yicong-Huang in #3877 - chore: relocate amber by @Yicong-Huang in #3878
- chore: remove
core/logby @Yicong-Huang in #3881 - chore: move
deploymenttobin/deploymentby @Yicong-Huang in #3883 - chore: rename
coretocommonby @Yicong-Huang in #3882 - chore: move
bin/sqltosqlby @Yicong-Huang in #3884 - fix: use
TEXERA_HOMEfor jooq by @Yicong-Huang in #3886 - fix: move
.scalafix.confout to root by @Yicong-Huang in #3888 - fix: use
logs/for all services by @Yicong-Huang in #3887 - fix: remove a duplicated python-language-service.sh by @Yicong-Huang in #3889
- chore(deps): bump brace-expansion from 1.1.11 to 1.1.12 in /core/gui by @dependabot[bot] in #3798
- refactor(frontend): Simplify onClickToggleGrids logic by @aglinxinyuan in #3896
- fix: restructure bin/deployment and update Dockerfiles for new repository structure by @bobbai00 in #3892
- feat(frontend): introduce dropdown menu for layers by @aglinxinyuan in #3898
- feat(bin): add some helper scripts by @Yicong-Huang in #3890
- fix(deployment): add image tag for MinIO image in helm chart by @aicam in #3903
- chore(doc): create pull request template for contributors by @Yicong-Huang in #3899
- feat(op): add Substring Sear...
Apache Texera(Incubating) Release v1.1.0
Release Highlights
1. Feature Upgrades
- Advanced Computing Unit Feature: We refined the Kubernetes-based computing unit layer by improving the UI/UX and enabling shared-memory control, GPU support and JVM memory tuning (#3337, #3363, #3371, #3372, #3400, #3451, #3591, #3604).
- Dynamic Admin Configuration System: Site settings are now stored in the database, and admins can customize logos and favicons with a live preview of branding changes (#3453, #3480, #3700, #3743).
2. Operators
- New Visualization Operators: We added Choropleth Map, Bullet Chart, Dendrogram, Violin Plot, Volcano Plot, Nested Table, Range Slider, and Time Series Plot operators (#3352, #3356, #3362, #3374, #3397, #3418, #3531, #3744).
- Machine Learning Operators: We introduced a suite of scikit-learn training operators (#3535).
- Utility Operators: We added a Sleep operator, enhanced the Generator with examples, and improved the Projection operator with column re-ordering (#3537, #3543, #3661).
- Enhanced Visualization System: We organized operators with a new category hierarchy, improved grouping, and first-level category display (#3364, #3369).
3. Authentication & Security
- Admin Account Management: Admin accounts can be auto-registered from configuration, and admins can leave comments to users (#3266, #3387, #3625).
- Security Enhancements: The system tracks user activity with last-login times, separates JWT expiration from activity windows, and sends email alerts for unauthorized access attempts (#3391, #3393, #3541, #3693, #3694, #3703, #3731).
- Access Control Improvements: We refined JWT secret access logic, optimized token refresh behavior, and added login confirmation requirements (#3357, #3703).
4. Infrastructure & Deployment
- Apache Foundation Compliance: We added ASF headers, NOTICE and DISCLAIMER files, integrated Apache Rat checks in CI, and updated license headers (#3388, #3414, #3415, #3631, #3632, #3637).
- Kubernetes and Helm Enhancements: We added GCP deployment support, a DaemonSet for image pre-pulling, ingress class configuration, and improved job ordering (#3358, #3366, #3376, #3386, #3419, #3652).
- Build System Improvements: We automated jOOQ code generation in SBT, removed jOOQ from CI, and migrated builds to a JDK to support Java UDFs (#3628, #3670, #3689, #3746, #3747).
- Configuration Management: We centralized configuration in a dedicated service, added environment-variable defaults, and stored site settings in the database (#3458, #3466, #3622).
- Centralized Config Service: We introduced a centralized config-service with database-backed dynamic settings, enabling admins to modify branding, upload limits, authentication parameters, and system configurations through a web dashboard without service restarts or redeployment (#3453, #3458, #3466, #3480, #3511, #3522, #3533, #3622, #3664, #3693, #3700, #3743).
- Dependency Updates: We upgraded PyArrow from 19.0.1 to 21.0.0, updated jOOQ to 3.16.23, and updated Formly to 6.3.12 (#3368, #3565, #3600, #3687).
5. UI and UX Improvements
- Workflow Workspace Enhancements: The workspace now supports dynamic name-field resizing, double-click to open operator results, auto-centering, broken-workflow alerts, and reliable refresh behavior (#3342, #3359, #3573, #3603, #3606, #3636).
- Result Panel Improvements: The result panel shows a compact statistics row, supports resizable panels, exports to HTML and Arrow, and auto-expands on click (#3429, #3435, #3568).
- Visual Design Updates: We introduced a new Apache Texera logo, clarified button styling, added an adaptive navigation menu, and improved port statistics displays (#3481, #3483, #3486, #3489, #3569, #3585, #3601, #3722).
- Dataset UI Improvements: The dataset UI shows version creation dates with timezone tooltips, displays upload speed and time estimates, updates progress in real time, and validates file sizes (#3426, #3427, #3612, #3662, #3702, #3706).
- Admin Dashboard Enhancements: The admin dashboard includes user activity metrics, a better comment interface, column layout refinements, and a live branding preview (#3436, #3464, #3594, #3625, #3700).
- Editor Experience: The editor shows operator descriptions in panels, supports dynamic sidebar customization, improves edge deletion and reconnection, and organizes the operator menu by category (#3349, #3364, #3411, #3437, #3479, #3533).
v1.0.0
Since 2016, our team has been developing the Texera system (https://texera.io/) to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!
Major Features
- Supporting low/no coding data science using workflows
- Parallel data-processing engine running on computing clusters
- Using the Apache Pekko actor-model system
- Supporting UDFs in Python, R, and Java
- Supporting ML training and inference
- Including a rich collection of ML operators
- Interactive workflow execution model that supports pausing and resuming
- Supporting collaborations with shared editing, shared execution, and version control
- Supporting debugging, including line-by-line debugging in Python UDFs
- Supporting reproducibility of data analysis
- Region-by-region execution with full pipelining in each region
- Storing execution results using Apache Iceberg
- Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
- Adopting a microservice-based architecture using Kubernetes and Docker
- Supporting computing isolation and storage isolation of multiple tenants
For users and developers
Texera first beta release 0.1.0
After the renaming process is finished, we publish the first release of Texera (version 0.1.0)