To parse the description of Asthma from the HTML code block retrieved in the previous exercise, a parsing expression can be used.
Parsing HTML
- Click on the Wikipedia_Info column's drop down arrow and select Edit column, then Add column based on this column

- Give this new column the title 'Wikipedia_Info_Parsed'
- In the Expression box, type the expression: value.parseHtml().select("page")[0].htmlText()

- The result will be that in your new Wikipedia_Info_Parsed column, only the description of Asthma will appear, parsed out from the rest of the HTML code.
- Various GREL parsing expressions can be found through sources like Library Carpentry and Stack Overflow.