Show simple item record

dc.contributor.authorWeerasinghe, M
dc.contributor.authorMaduranga, MWP
dc.contributor.authorKawya, MVT
dc.date.accessioned2024-03-26T05:32:04Z
dc.date.available2024-03-26T05:32:04Z
dc.date.issued2023-01
dc.identifier.urihttp://ir.kdu.ac.lk/handle/345/7530
dc.description.abstractWeb scraping, the process of extracting data from websites, plays a crucial role in data collection for research, analysis, and automation. However, traditional web scraping techniques face challenges such as handling dynamic websites, anti-scraping measures, and extracting structured data from unstructured web pages. In recent years, artificial intelligence (AI) has emerged as a powerful tool to enhance web scraping, offering solutions to overcome these challenges and improve data extraction efficiency and effectiveness. This review explores the application of AI techniques in web scraping, including natural language processing for information extraction, machine learning for web page classification and computer vision for web page parsing. The benefits of AI-enhanced web scraping include improved accuracy, enhanced efficiency, handling dynamic websites, and scalability. Further, there are multiple challenges with the use of AI in web scraping. Ensuring the ethical and responsible use of AI in scraping is crucial to respect privacy rights, intellectual property, and terms of service of websites. However, the ethical considerations and the need to adapt to evolving anti-scraping measures pose challenges. This review highlights the potential of AI in web scraping and emphasizes the importance of responsible and ethical practices.en_US
dc.language.isoenen_US
dc.subjectWeb scraping,en_US
dc.subjectArtificial intelligence,en_US
dc.subjectMachine learning,en_US
dc.subjectNatural language processing,en_US
dc.subjectData extractionen_US
dc.titleEnhancing Web Scraping with Artificial Intelligence: A Reviewen_US
dc.typeArticle Abstracten_US
dc.identifier.facultyFaculty of Computingen_US
dc.identifier.journalKDU SSFOCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record