Skip to content

Commit

Permalink
Export libcudf env for long string support
Browse files Browse the repository at this point in the history
Signed-off-by: Ayush Dattagupta <[email protected]>
  • Loading branch information
ayushdg committed May 21, 2024
1 parent fb63641 commit 3849f25
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions nemo_curator/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,8 @@
# See the License for the specific language governing permissions and
# limitations under the License.

import os

import dask

from .modules import *
Expand All @@ -21,3 +23,6 @@
# See https://github.com/NVIDIA/NeMo-Curator/issues/33
# This also happens when reading and writing to files
dask.config.set({"dataframe.convert-string": False})

# Enable libcudf large string support
os.environ["LIBCUDF_LARGE_STRINGS_ENABLED"] = "1"

0 comments on commit 3849f25

Please sign in to comment.