-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Insights: facebookincubator/velox
Overview
-
- 0 Merged pull requests
- 38 Open pull requests
- 14 Closed issues
- 22 New issues
Could not load contribution data
Please try again later
38 Pull requests opened by 26 people
-
Support Spark function str_to_map when map key duplicate Policy is LAST_WIN
#12317 opened
Feb 13, 2025 -
refactor: Setup scripts
#12321 opened
Feb 13, 2025 -
fix(docs): Small fixes for the Vectors doc
#12322 opened
Feb 13, 2025 -
feat(type): Support TDigestType in Presto to Velox Query Plan (#24546)
#12326 opened
Feb 13, 2025 -
refactor(parquet): Use velox parquet writer in Metadatatest
#12329 opened
Feb 13, 2025 -
fix: Fix NaN values in Spark collect_set aggregate function
#12335 opened
Feb 14, 2025 -
fix: Reduce fuzzer expression depth due to bias run failure
#12337 opened
Feb 14, 2025 -
refactor(spill): Only use query config to decide prefix sort enable status
#12340 opened
Feb 14, 2025 -
docs(blog): Add Velox Primer Part1 Post
#12348 opened
Feb 15, 2025 -
fix(parquet): Avoid SEGV if table column type does not match file column type
#12350 opened
Feb 15, 2025 -
Fix unused-value issue in velox/dwio/dwrf/test/WriterContextTest.cpp +1
#12351 opened
Feb 15, 2025 -
feat: Add Spark date_trunc and trunc function
#12352 opened
Feb 15, 2025 -
Fix unused-value issue in velox/dwio/dwrf/test/WriterContextTest.cpp +2
#12353 opened
Feb 16, 2025 -
feat: Support to specify partition key order in TableWrite operator
#12355 opened
Feb 17, 2025 -
fix: Remove operator== and hash in DwrfStreamIdentifier
#12356 opened
Feb 17, 2025 -
feat(parquet): enable collect statistic for int96
#12357 opened
Feb 17, 2025 -
fix: Fix dual linking of gflags when CMake variable `BUILD_SHARED_LIBS` is cached
#12359 opened
Feb 17, 2025 -
fix: Reduce lock cost in AsciiInfo updates
#12363 opened
Feb 17, 2025 -
feat: Add HashStringAllocator::InputStream
#12364 opened
Feb 17, 2025 -
add mysql compatible function bit_length
#12366 opened
Feb 18, 2025 -
feat(fuzzer): Add input generator for json_extract in expression fuzzer
#12367 opened
Feb 18, 2025 -
fix: Fix partition filters with timestamp value
#12368 opened
Feb 18, 2025 -
feat: Add support for Timestamp to Integral for Spark
#12369 opened
Feb 18, 2025 -
fix: Fix approx_set to use murmur3 hash function to match Presto
#12374 opened
Feb 18, 2025 -
feat: Handle errors during constant folding in expression compilation
#12376 opened
Feb 18, 2025 -
refactor: Remove unneeded includes
#12378 opened
Feb 18, 2025 -
feat: Add config to throw exception for duplicate keys in Spark map_concat function
#12379 opened
Feb 18, 2025 -
feat: Remove more unnecessary PTX code from Wave
#12380 opened
Feb 18, 2025 -
misc(fuzzer): Add stats about verification against reference DB in expression fuzzer
#12383 opened
Feb 19, 2025 -
fix: Fix NullHandlingMode for Spark min/max agg function
#12384 opened
Feb 19, 2025 -
refactor(fuzzer): Unlink CacheFuzzer from GTest
#12386 opened
Feb 19, 2025 -
misc(fuzzer): remove verifyWindow() in AggregationFuzzer
#12391 opened
Feb 20, 2025 -
feat(fuzzer): Allow functions to be tested only with sorted input in aggregation fuzzer
#12392 opened
Feb 20, 2025 -
refactor: Break up Presto custom types into Declaration and Registration files
#12393 opened
Feb 20, 2025 -
fix: Fix grouping key reordering during spilling
#12395 opened
Feb 20, 2025 -
feat: Add max(varchar, bigint) support
#12396 opened
Feb 20, 2025 -
misc: Remove the unnecessary merge in addSingleGroupRawInput of VarianceAggregate
#12397 opened
Feb 20, 2025 -
refactor(fuzzer): Unlink JoinFuzzer from GTest.
#12399 opened
Feb 20, 2025
14 Issues closed by 5 people
-
map_union_sum: Velox does not accept boolean type while Presto does
#12382 closed
Feb 19, 2025 -
CI: TableScanReplayerTest fails with "Memory pool TableScan_replayer_0 already exists"
#12360 closed
Feb 19, 2025 -
Unsupported filter in parquet reader: Filter(MultiRange, deterministic, null not allowed)
#908 closed
Feb 18, 2025 -
SIGSEGV Aws::Utils::Crypto::CleanupCrypto
#4562 closed
Feb 18, 2025 -
Bundled folly installation needs to link with [email protected]
#5309 closed
Feb 18, 2025 -
Build Folly with FOLLY_HAVE_INT128_T ON
#4864 closed
Feb 18, 2025 -
TpchBenchmark is broken
#6834 closed
Feb 18, 2025 -
Add TableScanBuilder in PlanBuilder
#7391 closed
Feb 18, 2025 -
Update CircleCI image to include DuckDB build
#7697 closed
Feb 18, 2025 -
Fix flaky S3FileSystemRegistrationTest.fileHandle
#8488 closed
Feb 18, 2025 -
Add support for max-connections option in S3FileSystem
#9436 closed
Feb 18, 2025 -
Hardcoded random seed in unit test ReaderTest.projectColumnsMutation
#12240 closed
Feb 18, 2025 -
Build failure with the GCC/Ubuntu debug job in CI
#12100 closed
Feb 18, 2025 -
Diff on parquet filter agg
#11257 closed
Feb 15, 2025
22 Issues opened by 7 people
-
Unlink JoinFuzzer from GTest
#12398 opened
Feb 20, 2025 -
map_union_sum: varbinary not accepted as a key type
#12390 opened
Feb 19, 2025 -
SIGABRT from the JsonParseFunction::sortIndices
#12389 opened
Feb 19, 2025 -
Unlink CacheFuzzer from GTest
#12375 opened
Feb 18, 2025 -
to_ieee754_64: value mismatch propagates to sha512 fuzzer failure
#12372 opened
Feb 18, 2025 -
json_parse: values mismatch between Velox/Presto
#12371 opened
Feb 18, 2025 -
Crash due to dual linking of gflags
#12358 opened
Feb 17, 2025 -
SEGV in reading parquet data if column type mismatch occurs
#12349 opened
Feb 15, 2025 -
wilson_interval_upper: value mismatch Presto is NaN
#12347 opened
Feb 14, 2025 -
beta_cdf: Presto throws while Velox returns 0.0
#12346 opened
Feb 14, 2025 -
to_ieee754_32: mismatch value between Presto/Velox
#12345 opened
Feb 14, 2025 -
json_extract: Velox throws while Presto returns NULL
#12344 opened
Feb 14, 2025 -
rpad: value mismatch between Presto and Velox
#12343 opened
Feb 14, 2025 -
Compilation failure on dwio/dwrf/common/Common.h
#12342 opened
Feb 14, 2025 -
word_stem: Presto vs Velox value mismatch
#12341 opened
Feb 14, 2025 -
bitwise_right_shift: mismatch values Presto/Velox
#12339 opened
Feb 14, 2025 -
log2: Presto vs Velox values differ
#12338 opened
Feb 14, 2025 -
Unexpected result in the Spark collect_set aggregate function
#12336 opened
Feb 14, 2025 -
bitwise_left_shift: Value mismatch during fuzzer run with nested call affecting upstream function call
#12330 opened
Feb 14, 2025 -
chi_squared_cdf: Velox doesn't accept infinity as input
#12327 opened
Feb 13, 2025 -
url_extract_protocol: value mismatch Velox produces output whilst Presto produces NULL
#12325 opened
Feb 13, 2025 -
url_extract_fragment: Velox produces value "" instead of NULL
#12324 opened
Feb 13, 2025
65 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
feat(s3fs): Add Metrics
#12213 commented on
Feb 19, 2025 • 40 new comments -
feat(functions): Add support for REST based remote functions
#10911 commented on
Feb 20, 2025 • 17 new comments -
feat: Add Spark to_json function
#11995 commented on
Feb 18, 2025 • 15 new comments -
fix: Use dequeue to track WindowPartitions in RowStreamingWindowBuild
#11077 commented on
Feb 20, 2025 • 15 new comments -
feat: Add Spark months_between function
#12110 commented on
Feb 15, 2025 • 14 new comments -
feat: Add Spark from_json function
#11709 commented on
Feb 17, 2025 • 11 new comments -
feat: Add Spark get_struct_field function
#12166 commented on
Feb 20, 2025 • 10 new comments -
refactor: Remove unused headers and gflags
#12248 commented on
Feb 19, 2025 • 6 new comments -
feat: Support Spark ArraySort with lambda function
#10138 commented on
Feb 20, 2025 • 6 new comments -
feat: Add JSON SerDe code for member field HiveConnectorSplit::bucketConversion
#12178 commented on
Feb 20, 2025 • 5 new comments -
fix(hashjoin): Create new VectorHashers for listNullKeyRows to prevent dangling pointer access
#12106 commented on
Feb 20, 2025 • 4 new comments -
docs: Adding async data cache configs and ssd cache documentation
#11429 commented on
Feb 19, 2025 • 4 new comments -
feat: Add single node mode for query trace tool
#12171 commented on
Feb 19, 2025 • 4 new comments -
feat: Add GEOS library as an optional dependency
#12243 commented on
Feb 18, 2025 • 4 new comments -
feat: Add Spark CAST(double/float as timestamp)
#12041 commented on
Feb 19, 2025 • 4 new comments -
feat: Add Global Config in place of gflags for ssd
#12181 commented on
Feb 20, 2025 • 3 new comments -
feat: Add support for leftSemiProject join in nested loop join
#12172 commented on
Feb 18, 2025 • 3 new comments -
feat(fuzzer): Add TopNRowNumberFuzzer
#12103 commented on
Feb 20, 2025 • 3 new comments -
feat(parquet): Add boolean rle decoder for Parquet
#11282 commented on
Feb 17, 2025 • 2 new comments -
fix(sparksql): Respect session timezone when casting varchar to timestamp
#11975 commented on
Feb 19, 2025 • 2 new comments -
feat: Add spark varchar_writeside_check function for writing to fixed length varchar type
#12125 commented on
Feb 18, 2025 • 2 new comments -
feat(iceberg): Add support for writing iceberg tables
#10996 commented on
Feb 14, 2025 • 1 new comment -
feat: Support timestamp and date types for Spark unix_timestamp function
#11128 commented on
Feb 19, 2025 • 1 new comment -
feat: Add Spark CAST(timestamp as integral)
#11468 commented on
Feb 19, 2025 • 1 new comment -
Refactor: handle in-output errors for string functions (Part 2)
#12064 commented on
Feb 16, 2025 • 0 new comments -
feat: Support Spark explode outer
#11954 commented on
Feb 17, 2025 • 0 new comments -
feat: Add map_from_entries Spark function
#11934 commented on
Feb 14, 2025 • 0 new comments -
Refactor: handle in-output errors for string functions (Part 1)
#11854 commented on
Feb 16, 2025 • 0 new comments -
misc(build): support build on openeuler 22.03 LTS and 24.03 LTS
#12161 commented on
Feb 18, 2025 • 0 new comments -
feat: Add support for double to timestamp cast for Spark
#12230 commented on
Feb 15, 2025 • 0 new comments -
feat(aggregate): Add complex type to map_union_sum
#12268 commented on
Feb 19, 2025 • 0 new comments -
feat: Add Geometry Presto type
#12274 commented on
Feb 14, 2025 • 0 new comments -
refactor: Minor cleanups for `variant::serialize`
#12275 commented on
Feb 20, 2025 • 0 new comments -
refactor: Update columnStats to return non-optional result
#12293 commented on
Feb 17, 2025 • 0 new comments -
fix: regexp_split fails in empty match pattern
#12305 commented on
Feb 18, 2025 • 0 new comments -
misc(fuzzer): Add custom data generation for JSON path
#12312 commented on
Feb 18, 2025 • 0 new comments -
feat: Add Velox implementation for tdigest_agg
#12315 commented on
Feb 18, 2025 • 0 new comments -
Use Velox Parquet Writer in the Parquet Writer tests
#12194 commented on
Feb 13, 2025 • 0 new comments -
Need to support offline compilation of xsimd/cpr dependencies
#8307 commented on
Feb 18, 2025 • 0 new comments -
Cache fuzzer is flaky in CI
#12167 commented on
Feb 18, 2025 • 0 new comments -
Optimize Broadcast Hash Join
#12107 commented on
Feb 19, 2025 • 0 new comments -
Spark fuzzer failure: Ordering nulls is not supported
#11601 commented on
Feb 19, 2025 • 0 new comments -
Possible flaky test MultiFragmentTest.abortMergeExchange
#11736 commented on
Feb 19, 2025 • 0 new comments -
NestedLoopJoin is significantly slower than vanilla spark
#12294 commented on
Feb 19, 2025 • 0 new comments -
feat(parquet): Support struct schema evolution matching by name
#5962 commented on
Feb 18, 2025 • 0 new comments -
misc: Enable Spark query runner as reference in aggregation fuzzer test
#9559 commented on
Feb 14, 2025 • 0 new comments -
Support semi projection join type in smj
#10456 commented on
Feb 20, 2025 • 0 new comments -
feat(type): Add VarcharN and VarbinaryN type support
#10727 commented on
Feb 13, 2025 • 0 new comments -
Fix shortDecimal test cases for Parquet.
#10894 commented on
Feb 20, 2025 • 0 new comments -
build(deps): Upgrade DuckDB dependency to 1.1.0
#10982 commented on
Feb 17, 2025 • 0 new comments -
Fix full outer result mismatch issue when output contains multiple matching rows
#11068 commented on
Feb 20, 2025 • 0 new comments -
feat: Add Spark date_trunc function
#11340 commented on
Feb 19, 2025 • 0 new comments -
misc: Improve OutputBufferManager initialization
#11350 commented on
Feb 18, 2025 • 0 new comments -
Add a new setup script for Fedora Linux (40)
#11352 commented on
Feb 20, 2025 • 0 new comments -
refactor(exchange): Rename taskId to remoteTaskId
#11424 commented on
Feb 19, 2025 • 0 new comments -
Fix JSON parser to allow control characters in JSON string input
#11433 commented on
Feb 18, 2025 • 0 new comments -
fix: Disable Thread Sanitization for an E2E Test that Times Out
#11596 commented on
Feb 20, 2025 • 0 new comments -
Bump cross-spawn from 7.0.3 to 7.0.6 in /website
#11606 commented on
Feb 20, 2025 • 0 new comments -
fix(iceberg): Positional delete bug
#11614 commented on
Feb 20, 2025 • 0 new comments -
feat: Combine low selectivity vectors generated by the hash join filter
#11739 commented on
Feb 17, 2025 • 0 new comments -
fix: Fix smj result mismatch issue in semi, anit and full outer join
#11771 commented on
Feb 17, 2025 • 0 new comments -
fix: Fix the MergeSource data lost issue
#11772 commented on
Feb 17, 2025 • 0 new comments -
feat: Add ip_prefix_collapse functions
#11778 commented on
Feb 14, 2025 • 0 new comments -
fix: Improve macro safety and casting practices by removing trailing semicolon
#11815 commented on
Feb 16, 2025 • 0 new comments -
feat: Optimize serializer decompress buffer for BufferInputStream
#11836 commented on
Feb 18, 2025 • 0 new comments