How could you add parallelism to make the encoding faster? #7

smahm006 · 2023-03-13T04:02:09Z

On line 140-141 of lib.rs, there is a comment where the author mentions he tried threading with rayon but noticed it wasn't much faster than python threads.

Currently the python version gets me the token length in ~0.26 seconds while this crate takes ~1.8 seconds so I propose we should add back threading to speed up the process.

Now I am still a bit new to Rust so this post is more to bring suggestions on how would we go integrating threading?

zurawiki · 2023-03-15T20:19:43Z

Good points. How are you benchmarking these times today?

There is some overhead with the pyo3 code as it stands and I hope we can optimize that away once openai/tiktoken#40 and openai/tiktoken#50 land

oslfmt · 2023-07-18T22:38:04Z

I'd like to contribute to this issue. Is this project still active?

zurawiki · 2023-07-18T22:38:54Z

Issue is open and the project is active! :) Happy to advise / review any PRs

oslfmt · 2023-07-20T02:10:32Z

Sweet! Besides this issue, any other notable issues/enhancements to work on? Gonna take a closer look tomorrow.

zurawiki · 2023-07-20T13:12:55Z

Great to hear! Issues that can be worked on are listed here in the GitHub issues. I'd recommend tackling each issue one at a time. You can comment on each issue that you're interested in working on.

ellipsis-dev · 2023-07-25T20:22:28Z

Sorry, BitBuilder couldn't generate a pull request for you. Please try again later. (wflow_xDf68BFfsE8dDc4G) 🤖

zurawiki added the enhancement New feature or request label Mar 23, 2023

zurawiki added the help wanted Extra attention is needed label Apr 5, 2023

zurawiki added the bitbuilder:create Assigns BitBuilder to create a Pull Request for this issue. label Jul 25, 2023

ellipsis-dev bot removed the bitbuilder:create Assigns BitBuilder to create a Pull Request for this issue. label Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How could you add parallelism to make the encoding faster? #7

How could you add parallelism to make the encoding faster? #7

smahm006 commented Mar 13, 2023

zurawiki commented Mar 15, 2023

oslfmt commented Jul 18, 2023

zurawiki commented Jul 18, 2023

oslfmt commented Jul 20, 2023

zurawiki commented Jul 20, 2023 •

edited

Loading

ellipsis-dev bot commented Jul 25, 2023

How could you add parallelism to make the encoding faster? #7

How could you add parallelism to make the encoding faster? #7

Comments

smahm006 commented Mar 13, 2023

zurawiki commented Mar 15, 2023

oslfmt commented Jul 18, 2023

zurawiki commented Jul 18, 2023

oslfmt commented Jul 20, 2023

zurawiki commented Jul 20, 2023 • edited Loading

ellipsis-dev bot commented Jul 25, 2023

zurawiki commented Jul 20, 2023 •

edited

Loading