Source: apache-arrow
Section: devel
Priority: optional
Maintainer: Debian Science Maintainers <debian-science-maintainers@alioth-lists.debian.net>
Uploaders:
  Steffen Moeller <moeller@debian.org>,
  Sascha Steinbiss <satta@debian.org>,
  Diane Trout <diane@ghic.org>,
  Dominique Belhachemi <domibel@debian.org>,
  Rene Engelhard <rene@debian.org>
Build-Depends:
# Restrict to 64-bit little-endian architectures: upstream does not yet support others.
  architecture-is-64-bit,
  architecture-is-little-endian,
#
  apache-arrow-testing <!nocheck>,
  apache-parquet-testing <!nocheck>,
  clang,
  cmake,
  cython3 (>= 3.1),
  debhelper-compat (= 13),
  dh-python,
  dh-r,
  dh-sequence-numpy3,
  git,
  libboost-dev,
  libbrotli-dev,
  libbz2-dev,
  libcurl4-openssl-dev,
  libgoogle-glog-dev,
  libgrpc++-dev,
  libmimalloc-dev,
  liblz4-dev,
  libre2-dev,
  libprotobuf-dev,
  libprotoc-dev,
  libsnappy-dev,
  libssl-dev,
  libthrift-dev,
  libutf8proc-dev,
  libxsimd-dev,
  libzstd-dev,
  llvm-dev,
  ninja-build,
  protobuf-compiler-grpc,
  python3-dev,
  python3-numpy,
  python3-setuptools-scm,
  python3-cffi <!nocheck>,
  python3-hypothesis <!nocheck>,
  python3-memory-profiler <!nocheck>,
  python3-pandas <!nocheck>,
  python3-pytest <!nocheck>,
  python3-pytest-arraydiff <!nocheck>,
  python3-pytest-cov <!nocheck>,
  python3-thrift <!nocheck>,
  r-base-dev,
  r-cran-assertthat,
  r-cran-bit64,
  r-cran-cpp11,
  r-cran-glue,
  r-cran-purrr,
  r-cran-r6,
  r-cran-rlang,
  r-cran-tidyselect,
  r-cran-vctrs,
  rapidjson-dev,
  zlib1g-dev
Standards-Version: 4.7.3
Vcs-Browser: https://salsa.debian.org/science-team/arrow
Vcs-Git: https://salsa.debian.org/science-team/arrow.git
Homepage: https://arrow.apache.org/
# To be synced with dev/tasks/linux-packages/apache-arrow/debian/control.in

Package: libarrow2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends}
Description: Apache Arrow data processing library
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files.

Package: libarrow-acero2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libarrow2300 (= ${binary:Version}),
Description: Apache library for processing large streams of data
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files for the Apache Acero module.

Package: libarrow-dataset2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libarrow2300 (= ${binary:Version}),
  libparquet2300 (= ${binary:Version})
Description: Apache Arrow is a data processing library for analysis
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files for Dataset module.

Package: libarrow-flight2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libarrow2300 (= ${binary:Version})
Description: Apache Arrow is a data processing library for analysis
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files for Flight RPC system.


Package: libarrow-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow2300 (= ${binary:Version}),
  libbrotli-dev,
  libbz2-dev,
  libgrpc++-dev,
  liblz4-dev,
  libre2-dev,
  libsnappy-dev,
  libssl-dev,
  libutf8proc-dev,
  libzstd-dev,
  protobuf-compiler-grpc,
  libprotobuf-dev,
  libprotoc-dev,
  libthrift-dev,
  zlib1g-dev
Description: Apache Arrow data processing library - header files
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files.

Package: libarrow-acero-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-dev (= ${binary:Version}),
  libarrow-acero2300 (= ${binary:Version}),
Description: Apache Acero is a tool for processing large streams of data
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files for the Acero module.

Package: libarrow-dataset-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-dev (= ${binary:Version}),
  libarrow-dataset2300 (= ${binary:Version}),
  libparquet-dev (= ${binary:Version})
Description: Apache Arrow is a data processing library for analysis
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files for dataset module.

Package: libarrow-flight-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-dev (= ${binary:Version}),
  libarrow-flight2300 (= ${binary:Version})
Description: Apache Arrow is a data processing library for analysis
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files for Flight RPC system.

Package: libarrow-flight-sql2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libarrow-flight2300 (= ${binary:Version})
Description: Apache Arrow Flight SQL library
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files for the Flight SQL extension.

Package: libarrow-flight-sql-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-flight-dev (= ${binary:Version}),
  libarrow-flight-sql2300 (= ${binary:Version})
Description: Apache Arrow Flight SQL library - development files
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files for the Flight SQL extension.

Package: libgandiva2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends:
  ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends}
Description: Runtime expression compiler for Apache Arrow records
 Gandiva is a runtime expression compiler that uses LLVM to generate
 efficient native code for compute on Arrow record batches. Among the
 transformations of Arrow data files, Gandiva only handles projections
 and filters.
 .
 Gandiva was designed to take advantage of the Arrow memory format and
 modern hardware. From the Arrow memory model, since Arrow arrays have
 separate buffers for values and validity bitmaps, values and their
 null status can often be processed independently, allowing for better
 instruction pipelining. On modern hardware, compiling expressions
 using LLVM allows the execution to be optimized to the local runtime
 environment and hardware, including available SIMD instructions. To
 reduce optimization overhead, many Gandiva functions are pre-compiled
 into LLVM IR (intermediate representation).

Package: libgandiva-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-dev (= ${binary:Version}),
  libgandiva2300 (= ${binary:Version})
Description: header files for Apache Arrow Gandiva library
 Gandiva is a runtime expression compiler that uses LLVM to generate
 efficient native code for compute on Arrow record batches. Among the
 transformations of Arrow data files, Gandiva only handles projections
 and filters.
 .
 This package provides C++ header files.

Package: libparquet2300
Section: libs
Architecture: any
Multi-Arch: same
Pre-Depends: ${misc:Pre-Depends}
Depends:
  ${misc:Depends},
  ${shlibs:Depends}
Description: Apache Parquet is a columnar storage format
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ library files to process the Apache Parquet
 format.

Package: libparquet-dev
Section: libdevel
Architecture: any
Multi-Arch: same
Depends:
  ${misc:Depends},
  libarrow-dev (= ${binary:Version}),
  libparquet2300 (= ${binary:Version})
Description: Apache Parquet is a columnar storage format
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides C++ header files.

Package: python3-pyarrow
Section: python
Architecture: any
Multi-Arch: no
Breaks:
  libarrow-python2300 (<< 23.0.1-2),
  libarrow-python-flight2300 (<< 23.0.1-2)
Replaces:
  libarrow-python2300 (<< 23.0.1-2),
  libarrow-python-flight2300 (<< 23.0.1-2)
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  ${python3:Depends},
  libarrow2300 (= ${binary:Version})
Description: Python interface to the Arrow C++ libraries
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 This library provides a Python API for functionality provided by the
 Arrow C++ libraries, along with tools for Arrow integration and
 interoperability with pandas, NumPy, and other software in the Python
 ecosystem.

Package: pyarrow-dev
Section: python
Architecture: any
Multi-Arch: no
Breaks: python3-pyarrow (<< 23.0.1-2)
Replaces: python3-pyarrow (<< 23.0.1-2)
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  python3,
  libarrow-dev (= ${binary:Version}),
  python3-pyarrow (= ${binary:Version})
Description: Python interface to the Apache Arrow C++ libraries
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This provides C headers for interacting with the pyarrow interface to
 the Arrow C++ libraries.

Package: r-cran-arrow
Section: gnu-r
Architecture: any
Multi-Arch: no
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  ${R:Depends},
  r-api-4.0,
  r-cran-assertthat,
  r-cran-bit64,
  r-cran-cpp11,
  r-cran-glue,
  r-cran-purrr,
  r-cran-r6,
  r-cran-rlang,
  r-cran-tidyselect,
  r-cran-vctrs
Description: R package for Apache Arrow data storage management
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 Arrow's libraries implement the format and provide building blocks
 for a range of use cases, including high performance analytics. Many
 popular projects use Arrow to ship columnar data efficiently or as the
 basis for analytic engines.
 .
 This package provides the R interface to arrow. It is the same as
 published independently on CRAN.

Package: arrow-tools
Section: utils
Architecture: any
Multi-Arch: foreign
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libarrow2300 (= ${binary:Version})
Description: Apache Arrow IPC format conversion utilities
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 This package provides command-line utilities for converting between
 Arrow IPC file and stream formats: arrow-file-to-stream and
 arrow-stream-to-file.

Package: parquet-tools
Section: utils
Architecture: any
Multi-Arch: foreign
Depends:
  ${misc:Depends},
  ${shlibs:Depends},
  libparquet2300 (= ${binary:Version})
Description: Apache Parquet file inspection utilities
 Apache Arrow defines a language-independent columnar memory format
 for flat and nested data, organized for efficient analytic operations
 on modern hardware like CPUs and GPUs. The Arrow memory format also
 supports zero-copy reads for lightning-fast data access without
 serialization overhead.
 .
 This package provides command-line utilities for inspecting Parquet files:
 parquet-reader, parquet-dump-schema, parquet-dump-footer,
 parquet-dump-arrow-statistics, parquet-scan.


