Add the support for GPT-4o model #398

MarkLee131 · 2024-06-29T12:48:26Z

add the model support for gpt-4o (https://platform.openai.com/docs/models/gpt-4o) and
update the token estimation logic to support this model.

…o main

DavidKorczynski · 2024-07-04T10:19:19Z

llm_toolkit/models.py

@@ -198,7 +198,10 @@ def estimate_token_num(self, text) -> int:
    """Estimates the number of tokens in |text|."""
    # https://cookbook.openai.com/examples/how_to_count_tokens_with_tiktoken
    try:
-      encoder = tiktoken.encoding_for_model(self.name)
+      if 'gpt-4' in self.name:  # gpt-4 and gpt-4o
+        encoder = tiktoken.encoding_for_model('gpt-4')


What is the reason for not specifying gpt-4o directly? https://github.com/openai/tiktoken/blob/c0ba74c238d18b4824c25f3c27fc8698055b9a76/tiktoken/model.py#L22-L23

Hi, @DavidKorczynski. I see the link you provided claimed the support for gpt-4o. But it seems now it gets the error below if running the code encoder = tiktoken.encoding_for_model(self.name) for gpt-4o directly:

Python 3.11.7 (main, Dec 8 2023, 18:56:57) [GCC 9.4.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import tiktoken >>> encoder = tiktoken.encoding_for_model("gpt-4o") Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/home/kaixuan/FDG_LLM/oss-fuzz-gen/.venv/lib/python3.11/site-packages/tiktoken/model.py", line 97, in encoding_for_model return get_encoding(encoding_name_for_model(model_name)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/kaixuan/FDG_LLM/oss-fuzz-gen/.venv/lib/python3.11/site-packages/tiktoken/model.py", line 84, in encoding_name_for_model raise KeyError( KeyError: 'Could not automatically map gpt-4o to a tokeniser. Please use `tiktoken.get_encoding` to explicitly get the tokeniser you expect.'

you can confirm it again on your env.

Likely because of

oss-fuzz-gen/requirements.in

Line 14 in 3d20914

tiktoken==0.5.1

-- let's bump?

Ok~
I have tried tiktoken 0.7.0, it works now if directly running encoder = tiktoken.encoding_name_for_model("gpt-4o") after upgrading the tiktoken.

@DavidKorczynski Do I need to recreate a PR to directly use tiktoken to get the encoding for gpt-4o? Or we can continue use this PR (if you can revise the code)?

I think what we need is update tiktoken to a newer version that includes gpt-4o and remove the changes in estimate_token_num as we no longer need them -- you can just do that in this PR and that should make a complete contribution in terms of gpt-4o support

Got it. I have finished and upgraded all dependencies needed by following https://github.com/google/oss-fuzz-gen/blob/main/USAGE.md#updating-dependencies as well.

Thanks @MarkLee131 -- if #433 is happy let's land this

DavidKorczynski · 2024-07-04T12:41:28Z

requirements.txt

@@ -1,5 +1,5 @@
 #
-# This file is autogenerated by pip-compile with Python 3.12
+# This file is autogenerated by pip-compile with Python 3.11


Are you sure changing the version doesn't cause regressions?

In addition, do we really need to update all the libs for a tiktoken update?

Hi, I've actually been running oss-fuzz-gen with python 3.11 and it should be fine.

Meanwhile, since some packages may be dependent on tiktoken, for insurance purposes, I just tried to update all the dependencies directly. :)

MarkLee131 added 8 commits June 24, 2024 15:18

fix the report page issue

2d8d3e4

Merge branch 'main' of https://github.com/MarkLee131/oss-fuzz-gen int…

83a06d5

…o main

Merge branch 'main' of https://github.com/MarkLee131/oss-fuzz-gen int…

e582d66

…o main

add the support for gpt-4o model

a4c00be

add the support for gpt-4o model

93c3cb5

update the encoder identification.

0a26d32

Merge branch 'main' into main

6fd9a39

Merge branch 'main' into main

0527374

DavidKorczynski requested changes Jul 4, 2024

View reviewed changes

MarkLee131 added 4 commits July 4, 2024 19:00

Merge branch 'main' into main

1a9cc4a

Update requirements.in to upgrade tiktoken

ce165e7

upgrade tiktoken to 0.7.0

12e7b5e

fix lint issues

d75d3e6

DavidKorczynski mentioned this pull request Jul 4, 2024

[experiment only] Check gpt 4.0 issues #433

Open

DavidKorczynski requested changes Jul 4, 2024

View reviewed changes

MarkLee131 added 2 commits July 4, 2024 22:54

Merge branch 'main' into main

32a5d7a

Merge branch 'main' into main

49fe4c9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the support for GPT-4o model #398

Add the support for GPT-4o model #398

MarkLee131 commented Jun 29, 2024

DavidKorczynski Jul 4, 2024

MarkLee131 Jul 4, 2024

DavidKorczynski Jul 4, 2024

MarkLee131 Jul 4, 2024

MarkLee131 Jul 4, 2024

DavidKorczynski Jul 4, 2024

MarkLee131 Jul 4, 2024 •

edited

Loading

DavidKorczynski Jul 4, 2024

DavidKorczynski Jul 4, 2024

DavidKorczynski Jul 4, 2024

MarkLee131 Jul 4, 2024

Add the support for GPT-4o model #398

Are you sure you want to change the base?

Add the support for GPT-4o model #398

Conversation

MarkLee131 commented Jun 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarkLee131 Jul 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarkLee131 Jul 4, 2024 •

edited

Loading