Skip to content

Commit

Permalink
Export libcudf env for long string support
Browse files Browse the repository at this point in the history
Signed-off-by: Ayush Dattagupta <[email protected]>
  • Loading branch information
ayushdg committed Jul 10, 2024
1 parent ba7e8a1 commit 19777e7
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions nemo_curator/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.

import os
import sys

import dask
Expand Down Expand Up @@ -48,3 +49,6 @@
# See https://github.com/NVIDIA/NeMo-Curator/issues/33
# This also happens when reading and writing to files
dask.config.set({"dataframe.convert-string": False})

# Enable libcudf large string support
os.environ["LIBCUDF_LARGE_STRINGS_ENABLED"] = "1"

0 comments on commit 19777e7

Please sign in to comment.