pandas numpy tabula-py openai gradio pyPDF2 marker-pdf groq bs4 nltk tiktoken pdf2image google-generativeai