-
Notifications
You must be signed in to change notification settings - Fork 361
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to train the Model on new dataset changing the vocabulary size #52
Comments
Hi, If the system is only printing "Generating first batch", it means that the batch generation failed which resulted in skipping of the epoch. So "Generating first batch" would have been printed the number of epoch times. Hope this helps. |
Thanks for the reply, actually the previous error has been resolved. Now the Error is in Dealing with Upper Case Letters. If you look at the following line of Code for c in lex: |
Actually i am trying to make the code working for both Upper and lowercase |
The issue is in the above Code actually it should be written in this manner for c in lex: |
Still not Getting Satisfactory Result. How can i train the Model to predict both Uppercase and Lowercase characters. |
@HidayatRahman how did you fix the image size issue? any progress regarding upper case training? Thanks in advance |
There was no issued in the Image size actually the issue was in my Conversion for uppercase letter. Anyhow i managed it and the code works for both upper and lower case, you have to add the code in data_gen.py where the ascii conversion actually occurs. And also you have to also do the decoding in model.py function manually. hope it helps, |
Hi @HidayatRahman , Could you please explain how to edit the model.py? I tried to do it using your hints but it doesn't work. Maybe if you share how exactly you modified the model.py, I should understand where I have the mistakes. Thank you! |
Sorry Man i just lost My Virtual Machine in which i was running the Code. but you have to do the reverse step in Model.py in order to decode it Properly. line number 427 and 428 does the same Here you have to just add another condition 64<ascii<91. I guess its pretty simple now. Let me know if its still not working. best wishes, |
Hi @HidayatRahman , Yes, I tried to do just that but I don't quite understand those 1 liner. I tried to expand the code to a one that I understand but I get an error. |
hi @HidayatRahman What is the vocabulary size used for training ? |
hi @shoaibahmed |
Hi,
I am training the Model on my own dataset which contains both uppercase and lowercase letter although it doesn't contains any wildcards so the new vocabulary is 26+26+10+3=65. The problem is the code only outputs Generating first batch instead of logging the loss and perplexity.
Any help would be appreciated.
thanks.
The text was updated successfully, but these errors were encountered: