File size: 1,912 Bytes
15ad492
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4c33bd0
 
15ad492
 
 
 
 
 
 
 
 
 
 
4c33bd0
 
15ad492
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
from langchain.prompts import (
    SystemMessagePromptTemplate,
    HumanMessagePromptTemplate,
    ChatPromptTemplate,
    PromptTemplate,
)

# Prompt templates
system_template_script = PromptTemplate(
    input_variables=["output_format", "html_content"],
    template="""You are a helpful assitant that helps people create python scripts for web scraping.
    --------------------------------
    The example of the html content is: {html_content}
    --------------------------------
    You have to create a python function that extract information from an html code using web scrapping.
    
    Try to select the deeper class that is common among the elements to make de find_all function.

    Your answer SHOULD only contain the python function code without any aditional word or character.

    Import the used libraries above the function definition.

    The function name must be extract_info.

    The function have to receive the html data as a parameter.

    Your function needs to extract information for all the elements with similar attributes.

    An element could have missing attributes

    Before calling .text or ['href'] methods, check if the element exists.

    ----------------
    FINAL ANSWER EXAMPLE:
    from bs4 import BeautifulSoup

    def extract_info(html):
        ...CODE...
        return {output_format}
    ----------------
    
    Always check if the element exists before calling some method.

    """,
)

human_template_script = PromptTemplate(input_variables=[], template="give me the code")

# Chat Prompt objects
system_template_script_prompt = SystemMessagePromptTemplate.from_template(
    system_template_script.template
)
human_template_script_prompt = HumanMessagePromptTemplate.from_template(
    human_template_script.template
)
chat_script_prompt = ChatPromptTemplate.from_messages(
    [system_template_script_prompt, human_template_script_prompt]
)