atributo de string en BeautifulSoup – Python

El atributo de string lo proporciona Beautiful Soup, que es un marco de web scraping para Python. El raspado web es el proceso de extracción de datos del sitio web utilizando herramientas automatizadas para acelerar el proceso. Si una etiqueta tiene solo un elemento secundario, y ese elemento secundario es NavigableString , se puede acceder al elemento secundario mediante .string .
Prerrequisito: Hermosa Instalación de Sopa .
Sintaxis:

 tag.string

Los siguientes ejemplos explican el concepto de string en Beautiful Soup.
Ejemplo 1: En este ejemplo, vamos a obtener la string.

Python3

# Import Beautiful Soup
from bs4 import BeautifulSoup
  
# Create the document
doc = "<body><b> Hello world </b><body>"
  
# Initialize the object with the document
soup = BeautifulSoup(doc, "html.parser")
  
# Get the b tag
tag = soup.b
  
# Print the string
print(tag.string)

Producción:

Hello World

Ejemplo 2: En este ejemplo, vamos a obtener el tipo de string.

Python3

# Import Beautiful Soup
from bs4 import BeautifulSoup
  
# Create the document
doc = "<body><b> Hello world </b><body>"
  
# Initialize the object with the document
soup = BeautifulSoup(doc, "html.parser")
  
# Get the b tag
tag = soup.b
  
# Print the type of string
print(type(tag.string))

Producción:

<class 'bs4.element.NavigableString'>

Publicación traducida automáticamente

Artículo escrito por gurrrung y traducido por Barcelona Geeks. The original can be accessed here. Licence: CCBY-SA

Python3

Python3

Deja una respuesta Cancelar la respuesta