Uses of Class
com.bytedesk.kbase.llm_website.crawl.WebsiteCrawlConfig
Packages that use WebsiteCrawlConfig
Package
Description
-
Uses of WebsiteCrawlConfig in com.bytedesk.kbase.llm_website
Fields in com.bytedesk.kbase.llm_website declared as WebsiteCrawlConfigModifier and TypeFieldDescriptionprivate @NotNull(message="\u6293\u53d6\u914d\u7f6e\u4e0d\u80fd\u4e3a\u7a7a") @Valid WebsiteCrawlConfigWebsiteCrawlConfigRequest.configprivate WebsiteCrawlConfigWebsiteCrawlRequest.configMethods in com.bytedesk.kbase.llm_website that return WebsiteCrawlConfigMethods in com.bytedesk.kbase.llm_website with parameters of type WebsiteCrawlConfigModifier and TypeMethodDescriptionWebsiteRestService.startCrawl(String websiteUid, WebsiteCrawlConfig config) 开始整站抓取WebsiteRestService.updateCrawlConfig(String websiteUid, WebsiteCrawlConfig config) 更新网站抓取配置 -
Uses of WebsiteCrawlConfig in com.bytedesk.kbase.llm_website.crawl
Methods in com.bytedesk.kbase.llm_website.crawl that return WebsiteCrawlConfigModifier and TypeMethodDescriptionWebsiteCrawlTask.getConfig()获取抓取配置static WebsiteCrawlConfigWebsiteCrawlConfig.getDeep()获取深度配置(更大深度和页面数)static WebsiteCrawlConfigWebsiteCrawlConfig.getDefault()获取默认配置static WebsiteCrawlConfigWebsiteCrawlConfig.getFast()获取快速配置(较少深度和页面数)Methods in com.bytedesk.kbase.llm_website.crawl with parameters of type WebsiteCrawlConfigModifier and TypeMethodDescriptionvoidWebsiteCrawlTask.setConfig(WebsiteCrawlConfig config) 设置抓取配置 -
Uses of WebsiteCrawlConfig in com.bytedesk.kbase.llm_website.service
Methods in com.bytedesk.kbase.llm_website.service with parameters of type WebsiteCrawlConfigModifier and TypeMethodDescriptionprivate booleanWebsiteCrawlerService.crawlSinglePage(String url, WebsiteEntity website, Set<String> visitedUrls, Set<String> urlsToVisit, WebsiteCrawlConfig config) 抓取单个页面private WebsiteCrawlTaskWebsiteCrawlerService.createCrawlTask(WebsiteEntity website, WebsiteCrawlConfig config) 创建抓取任务private voidWebsiteCrawlerService.extractLinks(org.jsoup.nodes.Document doc, String currentUrl, String baseUrl, Set<String> urlsToVisit, WebsiteCrawlConfig config) 提取页面链接private booleanWebsiteCrawlerService.isValidContent(String content, WebsiteCrawlConfig config) 验证内容是否有效private WebsiteCrawlResultWebsiteCrawlerService.performCrawl(WebsiteEntity website, WebsiteCrawlTask task, WebsiteCrawlConfig config) 执行抓取任务private booleanWebsiteCrawlerService.shouldCrawlUrl(String url, URL baseUrl, WebsiteCrawlConfig config) 判断URL是否应该被抓取WebsiteCrawlerService.startCrawl(String websiteUid, WebsiteCrawlConfig config) 开始整站抓取