Jsoup get metadata from HTML

Let us discuss how to get metadata from HTML using Jsoup API with the help of below example.

Follow the below steps:

1. Use connect(String url) method of Jsoup class which returns the connection of specified URL. 2. Use get() method of Connection class which returns Document object. 3. Get metadata (description and keywords) from document object. 4. Print metadata.

Example:

JsoupGetMetaData.java

import java.io.IOException;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
 
/**
 * This class is used get meta data from HTML using Jsoup.
 * @author w3spoint
 */
public class JsoupGetMetaData {
  public static void main(String args[]){
    Document document;
    try {
    	//Get Document object after parsing the html from given url.
	document = Jsoup.connect(
          "https://www.w3schools.blog/jsoup-get-images-from-html-example/")
           .get();
 
	//Get description from document object.
	String description = 
              document.select("meta[name=description]").get(0)
              .attr("content");
	//Print description.
	System.out.println("Meta Description: " + description);
 
	//Get keywords from document object.
	String keywords = 
                document.select("meta[name=keywords]").first()
                .attr("content");
	//Print keywords.
	System.out.println("Meta Keyword : " + keywords);
 
    } catch (IOException e) {
	e.printStackTrace();
    }		
  }
}

Output:

Meta Description: 
Jsoup get images from HTML example. Jsoup stands for Java HTML parser. 
It is an open source java library which provides API for extracting 
and manipulating data
Meta Keyword : 
jsoup, tutorial, beginners, professionals, images, 
introduction, example, java, html, parser

Download this example.   Next Topic: Jsoup get form parameters example. Previous Topic: Jsoup get images from HTML example.

 

Please follow and like us:
Content Protection by DMCA.com