تلخيص تلقائي للنصوص العربية pdf
ملخص الدراسة:
Arabic language is one of the most famous languages in the world; its importance comes from being the fifth language that has native speakers in the world. Creating a good summary of the text is one of the most important things in the linguistics because it gives the user the most important paragraphs in the text that he wants to read. There are some techniques to summarize Arabic language, but they are still little and need to be improved. One approach that are used in text summarization is graph based but it still need enhacment. This thesis builds a new algorithm called GBATSS (Graph Based Arabic Text Summarizer) to summarize Arabic text depending on NLP and Google page rank algorithm. The system works on three basic units. These units are rooted stem, light stem, and finally no-stem. The system depends on compression ratio of 40 %. The process of summarization is done in 12 stages start from data collection, text preprocessing, text normalization, text tokenization, stemming, stop words removal, building graph, calculating edge weighting, applying page rank, and finally extracting the summary. Finally, we tested the system using EASC data set and using the recall, precision and f-measure for evaluation process. The results show that the using of rooted-stem as a basic unit gives the best results then no-stem and finally light-stem.
توثيق المرجعي (APA)
Elfarra, Eyad (2015). Automatic Arabic Text Summarization. الجامعة الإسلامية - غزة. 19881
خصائص الدراسة
-
المؤلف
Elfarra, Eyad
-
سنة النشر
2015
-
الناشر:
الجامعة الإسلامية - غزة
-
المصدر:
المستودع الرقمي للجامعة الإسلامية بغزة
-
نوع المحتوى:
رسالة ماجستير
-
اللغة:
English
-
محكمة:
نعم
-
الدولة:
فلسطين
-
النص:
دراسة كاملة
-
نوع الملف:
pdf