Search news
thml.search.news
¶
Classes:
-
GooNews–A class that allows you to search for news articles using Google News.
Functions:
-
news_by_topic–Generate news video by topic.
-
news_generator–Generate news videos for a list of topics.
GooNews(language='en', country='US', max_results=100, period=None, start_date=None, end_date=None, exclude_websites=None, proxy=None)
¶
Bases: GNews
A class that allows you to search for news articles using Google News.
Parameters:
-
language(str, default:'en') –The language in which to return results. Defaults to en
-
country(str, default:'US') –The country code of the country you want to get headlines for. Defaults to US
-
max_results(int, default:100) –The maximum number of results to return. The default is 100. Defaults to 100
-
period(str, default:None) –The period of time from which you want the news
-
start_date(str, default:None) –Date after which results must have been published
-
end_date(str, default:None) –Date before which results must have been published
-
exclude_websites(list, default:None) –A list of strings that indicate websites to exclude from results
-
proxy(dict, default:None) –The proxy parameter is a dictionary with a single key-value pair. The key is the
Methods:
-
get_article–Download an article from the specified URL, parse it, and return an article object.
-
download_article_material–Download the article's text, images, and videos to the specified directory.
get_article(url)
¶
Download an article from the specified URL, parse it, and return an article object.
Parameters:
-
url(str) –The URL of the article you wish to summarize.
Returns:
-
–
The article as defined by package
newpaper4k, see here: https://newspaper4k.readthedocs.io/en/latest/user_guide/api_reference.html. -
–
This is different from the previous implementation which used
newspaper3k.
download_article_material(url, output_dir='./article_material')
¶
Download the article's text, images, and videos to the specified directory.
Parameters:
-
url(str) –The URL of the article you wish to download.
-
output_dir(str, default:'./article_material') –The directory to save the article's material to. Defaults to "./article_material".
news_by_topic(topic: str)
¶
Generate news video by topic.
Tasks: 1. Get google news by topic 2. Download article material (text, images, videos). May be only text available. 3. Search related videos (youtube, google,...) and download them. 4. Generate video
news_generator()
¶
Generate news videos for a list of topics.
Tasks: Run news_by_topic for each topic in a given list.