<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>University of Tabriz</PublisherName>
				<JournalTitle>Tabriz Journal of Electrical Engineering</JournalTitle>
				<Issn>2008-7799</Issn>
				<Volume>47</Volume>
				<Issue>3</Issue>
				<PubDate PubStatus="epublish">
					<Year>2017</Year>
					<Month>11</Month>
					<Day>22</Day>
				</PubDate>
			</Journal>
<ArticleTitle>A Method for Automatic Key phrase Extraction from Persian Web News</ArticleTitle>
<VernacularTitle>A Method for Automatic Key phrase Extraction from Persian Web News</VernacularTitle>
			<FirstPage>857</FirstPage>
			<LastPage>866</LastPage>
			<ELocationID EIdType="pii">6219</ELocationID>
			
			
			<Language>FA</Language>
<AuthorList>
<Author>
					<FirstName>M.</FirstName>
					<LastName>Basereh</LastName>
<Affiliation>Faculty of Electrical and Computer Engineering, University of Yazd, Yazd, Iran</Affiliation>

</Author>
<Author>
					<FirstName>V.</FirstName>
					<LastName>Derhami</LastName>
<Affiliation>Faculty of Electrical and Computer Engineering, University of Yazd, Yazd, Iran</Affiliation>

</Author>
<Author>
					<FirstName>S.</FirstName>
					<LastName>Zarifzadeh</LastName>
<Affiliation>Faculty of Electrical and Computer Engineering, University of Yazd, Yazd, Iran</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2016</Year>
					<Month>06</Month>
					<Day>28</Day>
				</PubDate>
			</History>
		<Abstract>Text documents, especially news, are one of the important information retrieval fields which are necessary to extract information. This job, is done by extracting key phrases which include the main context of the news. In this research, a three level approach combining lingual, supervised learning, heuristic, and a relatively comprehensive number of statistical approaches, is suggested for key phrase extraction from Persian news web pages. A news dataset and a stop word list are generated. In this research, according to the data characteristics, Random Forest classifier is used; and its good performance is proved through experiments. Furthermore, using scores given by classifier to phrases, to build an ordered list of phrases, for classification, instead of using the classifier output, is suggested. Results show an acceptable f-measure.</Abstract>
			<OtherAbstract Language="FA">Text documents, especially news, are one of the important information retrieval fields which are necessary to extract information. This job, is done by extracting key phrases which include the main context of the news. In this research, a three level approach combining lingual, supervised learning, heuristic, and a relatively comprehensive number of statistical approaches, is suggested for key phrase extraction from Persian news web pages. A news dataset and a stop word list are generated. In this research, according to the data characteristics, Random Forest classifier is used; and its good performance is proved through experiments. Furthermore, using scores given by classifier to phrases, to build an ordered list of phrases, for classification, instead of using the classifier output, is suggested. Results show an acceptable f-measure.</OtherAbstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Keyphrase</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">keyphrase extraction</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">news texts</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">statistical techniques</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">supervised learning</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">text mining</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">information retrieval</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://tjee.tabrizu.ac.ir/article_6219_4f33a58972c07a79ba33e22812c65f5f.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
