2655 comments found.
Hello. I need to move the license to a new domain. I just changed the name of the site. How do I transfer the license or get a new one? thanks
Hi,
Please see this FAQ.
Will this scrape image inside directories? Can I set this to scrape for ALL images recursively inside a wordpress upload folder(s)?
Hi,
The plugin can only save the images inside the posts, and it cannot recursively find the post URLs, unfortunately.
Hi, is this plugin work with Classified Listing Pro plugin, i mean can i get other listing from other website or only work with post like a blog ? Thanks
Hi,
I am not familiar with the inner workings of the specified plugin, but WP Content Crawler can save the posts as custom post types. It is also possible to save plain text values for custom fields and taxonomies.
I didn’t find in docs how to update the plugin to new version. Do you have link to this?
Hi,
You can update the plugin in the same way you update any other plugin. You have two options:
- Deactivate the plugin, delete it, and install the new version. Your site settings will not be deleted. They are stored in the database. So, removing the plugin’s files from your site or deactivating the plugin does not delete the settings.
- On the sidebar of your admin panel, go to Dashboard > Updates page. If there is a new version, it will be displayed there. Then, you can update the plugin from that page. Please note that if you did not enter your purchase code, an update will not be displayed there, even if there is a new version.
I get an error when I use tool below: https://ibb.co/0VzvjBg. I try to access the link and it says to check the robot. Can I fix this error?
Hi,
This error does not seem to be related to the plugin. You can try to find a solution online by searching the error you received on the Internet. For example, there is this article.
Hi,
It looks like something installed on your WordPress site prevents the plugin from connecting to the license server, as one of your screenshots show that your server is capable of connecting to the license server. Maybe you can enable the debugging feature of WordPress, reproduce the issue, and then check the debug.log file to see if the error is written there.
<script type="text/javascript">var ......
window.dataLayer=window.dataLayer||[];dataLayer.push({'pageCategory':'1001009'});dataLayer.push({'pageType':'Article'});dataLayer.push({'pagePlatform':'Web'});dataLayer.push({'pageSubcategory':'Thường thức'});dataLayer.push({'articleId':'4510532'});dataLayer.push({'articleTitle':'Những nền văn minh lâu đời nhất hành tinh'});dataLayer.push({'articleAuthor':'1700000248'});dataLayer.push({'articleAuthorName':''});dataLayer.push({'articlePublishDate':'20220914200000'});
...
dataLayer.push({'articleTags':'nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại'});
...
dataLayer.push({'tag_id':'51467, 1527186, ...https://vnexpress.net/nhung-nen-van-minh-lau-doi-nhat-hanh-tinh-4510532.html"/>
I use Regex for get tags name (\barticleTags\’:\’([^;]+)\b) and get the value as nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại
Could you please show me how to import this value to tags?
Hi,
I think the easiest way to do this is to create another element that contains the tag as its text. Here is how you can do it:
-
Go to Post Tab > Manipulate HTML Section > Find and replace in element HTML setting and configure it like this:
Regex: Checked Selector: script Find: \barticleTags\':\'([^;]+)\b Replace: </script><span id="my-tag">$1</span><script>
This should convert the script element containing the tag to something like this:
<script type="text/javascript">var ...... window.dataLayer=window.dataLayer||[];dataLayer.push({'pageCategory':'1001009'});dataLayer.push({'pageType':'Article'});dataLayer.push({'pagePlatform':'Web'});dataLayer.push({'pageSubcategory':'Thường thức'});dataLayer.push({'articleId':'4510532'});dataLayer.push({'articleTitle':'Những nền văn minh lâu đời nhất hành tinh'});dataLayer.push({'articleAuthor':'1700000248'});dataLayer.push({'articleAuthorName':''});dataLayer.push({'articlePublishDate':'20220914200000'}); ... dataLayer.push({' </script> <span id="my-tag">nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại</span> <script>'}); ... dataLayer.push({'tag_id':'51467, 1527186, ...https://vnexpress.net/nhung-nen-van-minh-lau-doi-nhat-hanh-tinh-4510532.html"/> .... </script>In other words, by making this change, you add the following element:<span id="my-tag">nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại</span>
-
Then, go to Post Tab > Post Tag Selectors setting and configure it like this to select the content of the newly created element’s text as a post tag:
Selector: #my-tag Attribute: text
Please note that I could not check if this works as intended since you did not provide the URL of the page that contains the script element.
I did not realize that the URL in the script was for the post
So, I checked the page’s source code. Actually, the tags exist in other places as well, meaning that you can simply go to Post Tab > Post Tag Selectors setting and configure it like this:
Selector: meta[name="keywords"] Attribute: content
Like this: https://ibb.co/wsG9s3z
It looks like you missed my previous reply. You can use the meta element that contains the tags.
Thank you for your reply, btw, I set Exchange element attributes for img from data-src to src, but test not work https://ibb.co/tbXMPCj https://ibb.co/mCkKmHy
Tester: https://ibb.co/02TVfmC It just get old src?
There can be two causes of this. The first one is that you might not have saved the settings. The other one is that there might be other elements in the page that already contain the correct image URL in their “src” attributes. You can specifically target the elements whose src attribute contains a data URL via a CSS selector like this in the Exchange element attributes setting:
.fig-picture picture .lazy[src^="data:"]
The CSS selector above should select the elements whose src attribute’s value starts with “data:”.
It’s not work! I have already updated setting every times
I do not see any issues in the demo as it can be seen in this screenshot: https://ibb.co/pxwvQrC Could you please import your settings to the demo and check if the same issue happens there as well?
When I change post content selectors to ”.fck_detail” (first is ”.Normal”), it’s worked! Thanks for your support
All right. Thanks for letting me know.
Your welcome! and why it’s not save the post? Just queue the links? https://ibb.co/DD7cZGg
You can see this page to troubleshoot automatic crawling issues.
I did all suggest in this page, and I set my setting in demo page: penguinlvt (509812). Could you please advise me?
The posts are crawled. But, because you configured the post date selectors to find a date that is different from today, you do not see the crawled posts in the dashboard. You can track the number of saved posts by looking at the “all” column: https://ibb.co/WFLQB79
Oh yeah, I see! https://ibb.co/BKdt3B4 It crowled like crazy hihi
I deleted all post then I use “tool” retrieve but It’s error https://ibb.co/LR66rRW
Hi,
If you are checking the duplicates via the post title and/or content, there might be another post that matches the criteria. Maybe you can check the duplicates only via URL.
when I used “tester” it’s work ok, and have image in the content. But auto post is it’s not work, not image in content, link image change to error link https://ibb.co/DzfQbVd
Hi,
You need to put the image URL into the “src” attribute of the “img” element. You can see this page to learn how to do that.
I change domain, import setting but It’s not work in new domain https://ibb.co/PZgmxTW. Old domain still work
Hi,
The target website might be blocking your new server, since it sends “503 Service Unavailable” response. You might be able to access it by using a proxy or changing the request headers such as HTTP User Agent header. In any case, the issue is not related to the plugin, because the plugin successfully makes a request to the target site.
I use tab https://ibb.co/fvBfG4V. All content retrieved but not in tab.
Hi,
If the tab’s contents are created via JavaScript, the plugin cannot retrieve them. For more information, you can see this page.
I retrieved the content. With my web, I want to post the content in the tab but when I use “tester” it doesn’t cut the content into the tab. I tried use “tool” manual post, it doesn’t work either
The tabs are created via JavaScript. You should configure your theme to convert the contents into tabs. You cannot create the tabs via the plugin.
I have this code in html:
Tags: <a href="http://redsvn.net/tag/nhat-ban/" rel="tag">Nhật Bản</a>, <a href="http://redsvn.net/tag/the-chien-2/" rel="tag">Thế chiến II</a><br /><br />
<div id="fb-root" />
It showed in web like this: https://ibb.co/zsh4zGj And I get tags name by this setting: https://ibb.co/ysx2mfG Now, when I remove this code in raw HTML or in HTML at first load, it can not crawl tags name. https://ibb.co/4fjvfL2 How I can remove this code and keep tags name?
Thank you
Ps: I replace this code to <!-Tags…-> but it cannot get tags name too.
Hi,
You can use Templates Tab > Manipulate HTML Section > Find and replace in post’s content setting to make the replacement. The rules defined in this setting are executed after the tags are retrieved. You can see when exactly each setting is applied here.
I tried the demo, how do I search and replace in text-only paragraphs?
Hi,
You cannot directly find and replace in text-only paragraphs, unfortunately. However, you can find and replace in the element that contains the paragraph by using, for example, Post Tab > Manipulate HTML Section > Find and replace in element HTML setting.
Hello, I love the plugin, thanks!
I am having a couple of small issues though and wondering if you can help… I’m using the plugin on a child site to crawl a category on a parent site. I have successfully managed to get the crawl to work and post the posts in the category on the parent site to the child site. However:
1) The featured image quality is low 2) The Comments do not work on the new post (despite the tick box in General settings -> Posts being clicked)
Any ideas?
Thanks.
Hi,
Thanks, I am glad to hear that.
1) The plugin downloads the images as-is. It does not change the size or quality of the images. If the image URL found in the target site is for a high-quality version of the image, then maybe your theme displays a lower-quality version of the downloaded image, as WordPress might be creating different sizes for the image when it is saved. If that is the case, you probably need to configure your theme to use a high-quality version of the saved image. Otherwise, if the URL of the image is for a low-quality version, then you need to make the plugin find the URL of the high-quality version of the image, so that the plugin can download the high-quality image from the target site.
2) There might be a few reasons for this. I just crawled a post in the demo site of the plugin, and the post can be commented on. This indicates that the issue is either related to your configuration or your theme. If you are using custom general settings for the site, the checkbox of “Allow Comments” setting might not be checked there. If you are not using custom general settings, and the “Allow Comments” checkbox is checked in the general settings, then maybe your site or your theme is configured to not allow comments.
If you previously crawled a post when the “Allow Comments” checkbox was not checked, and you checked the checkbox after that, the previously crawled posts will not be automatically configured to allow comments. You need to recrawl them.
Thank you for your fast reply.
1) The source url is for a high quality image, however the image saved is 300px wide. So guessing it’s something in this step rather than theme template (using Elementor Pro theme builder for this). I have no image compression plugins running… any ideas?
2) “Allow comments” is checked. The post itself is allowed to be commented on. Theme is configured to allow comments. I guess it’s something to do with Elementor theme builder displaying – just very weird it works perfectly on any native posts…
Any thoughts?
Thanks again. M
Just as a further point… on 2) When I edit with Elementor on the Single Post template on one of the freshly crawled posts, it has an error that says: Comments are closed. Switch on comments form the discussion box or the post edit screen… both of these are on and were on when the post was imported.
And, the related posts doesn’t appear, which is normally inserted by Jetpack…
Thanks again for your help on this
As I said, the plugin does not change the size or quality of the downloaded images. It is not capable of doing that. There is no code in the plugin for resizing an image. Hence, it is impossible for the plugin to do that.
About the comments, I checked it in the demo site, and it works without any issues. Maybe it is related to another plugin or your theme. Have you tried disabling other plugins, converting your theme to one of the official WordPress themes, and then checking if the issue still happens?
Hello, just replying to this as I still haven’t found a resolution.
I’ve disabled all plugins, reverted to original wordpress theme, spoken to Elementor support, who say it must be your fault of course… I don’t know what to do…
The featured image that is scraped is low quality on the new site. This is the major issue.
The comments do not work on most of the scraped posts (it seems they do on some).
Everyone just keeps finger pointing and blaming another plugin. I’ve also had to buy Content Crawler for 2 sites, when all i need it to do is crape one post for each site approximately once per month. So I have 2 licenses, both not working properly.
Can you help me?
Could you please import your site settings to the demo site and try to reproduce the issues you are having? Do the same issues happen there as well?
hello, thanks for your speedy reply – appreciate the help.
i imported the scraper settings and on the post it scraped, no featured image at all?
comments are there
no pictures seem to be scraped inside the content either
Could you please tell me the name of the site settings you created in the demo so that I can check it?
MM Test
Selector: img[data-src] Attribute 1: src Attribute 2: data-src
You can also see this screenshot: https://ibb.co/g4QDsbd
Once I do that, the test results are shown like this: https://ibb.co/RcgpXRn As you can see, the image in the post content is shown. Also, the featured image is saved. Its dimension is 2560×1920.
You should always test your site settings by using the Tester page to catch this type of issues and fix them.
great thanks! and what about comments sometimes not working? on this post The Chief Explorer’s Almanac on the demo site they don’t work for example.
I can see that the comments are enabled for that post. Could you please further explain what you mean by that?
By the way, I saw that you also configured the plugin to save the featured images from the category pages. If the featured images are available in the post pages, the recommended way is to save them only from the post pages. If you still want to do that, you should also exchange the “src” and “data-src” attributes’ values in the category pages.
sorry, i’m still not familiar with all these terms. I would prefer to scrape the featured image from the post than the category – what vaules do I change where?
yes, you’re right comments are working. they worked on the last post scraped, so maybe this was settings I had wrong in the begin. i’ll keep an eye on the comments and reach out if I have any issues. really appreciate your help 
a great plugin!
You can simply disable the crawling of the featured images under the Category tab by unchecking Category Tab > Featured Images Section > Save featured images? setting’s checkbox.
amazing thank you – you’re a legend!
Thanks. I am glad it is resolved. By the way, since the site settings on your site was configured to crawl featured images from the category pages, you should delete the already-saved post URLs from the database by using the Tools Page > URLs Tab > Clear URLs tool. You should delete all URLs for the site. Please do not forget to check the “I’m sure” checkbox. The reason you should do this is to clear the featured image URLs stored in the database. Otherwise, the plugin will keep using those wrong featured image URLs. The plugin will save the post URLs from the category pages again, that time without storing the featured image URLs.
ok great, thanks. is there a way of asking the plugin to rescrape the existing posts to update the featured image to a new high resolution one?
I see that you already enabled the recrawling. When it is enabled, the plugin updates the posts. However, since you previously saved the featured image URLs from the category pages, it doesn’t update the featured images. So, once you disable crawling of the featured images from the category pages and delete the existing URLs from the database, the plugin will update the featured images by using the settings under the Post Tab.
amazing, so just sit back and wait. thanks again!! have a great day!
Thanks, you too.
If the recrawling interval is too long, you can also manually recrawl them by using Tools Page > Manual Recrawling Tab.
319 / 5 000 Hello turgutsaricam, I would like to use your great plugin, I tried it a while ago on a previous server, but I already deleted it. Now I would like to use it on a new domain, but it won’t let me, it says This license has reached its domain limit and is not valid for this domain. Registered domains: hosting136592.a2e49.netcup.net How can I move it again? Many thanks: Tamas
Hi,
Could you please send your purchase code via the contact form on my profile page so that I can remove the registered domain from your license?
Hello, why caught the posts in the queue and cannot be saved?
Hi,
You can see this page to troubleshoot the automatic crawling problems.
hi, are you sure that the new MS Translate API (portal.azure.com) working fine? The Google API works fine, but not Mircosoft:
Message: Exception Details: Response could not be retrieved from Microsoft Translator Text API with https://api.cognitive.microsofttranslator.com/translate?api-version=3.0&from=es&to=de&textType=html - file_get_contents(https://api.cognitive.microsofttranslator.com/translate?api-version=3.0&from=es&to=de&textType=html): failed to open stream: HTTP request failed! HTTP/1.1 401 Unauthorized (2) Type: error
Hi,
Because the plugin does not have an option that you can use to specify a region, when creating the API key on Microsoft Azure Portal, please make sure you create a global API key, not a region-specific one.
ah okay, i set it on global:
{code provided location ‘global’ is not available for resource group. List of available regions is ‘centralus,eastasia,southeastasia,eastus,eastus2,westus,westus2,northcentralus,southcentralus,westcentralus,northeurope,westeurope,japaneast,japanwest,brazilsouth,australiasoutheast,australiaeast,westindia,southindia,centralindia,canadacentral,canadaeast,uksouth,ukwest,koreacentral,koreasouth,francecentral,southafricanorth,uaenorth,australiacentral,switzerlandnorth,germanywestcentral,norwayeast,jioindiawest,westus3,qatarcentral,swedencentral,australiacentral2’.”}
I see the “global” region when creating a translator as it can be seen in this screenshot. If you have trouble finding it, you can contact the support of Microsoft Azure.
I want to transfer license of the domain luxurynuochoa.com to subpok.com.
My old domain has been lost and cannot be deactivated
Could you please send your purchase code via the contact form on my profile page so that I can remove the domain registered to your license?
I wanna crawl title in playlist https://www.youtube.com/playlist?list=PLDYlX8j8IhZJOyLfxg5BF1aVdFMAAujRy can I help me?
The plugin’s support does not cover configuration of the settings, unfortunately.
I see title in the JSON How to pick value?
Maybe you can use the “title” element instead of the JSON, since it is easier to retrieve the “title” element because the post title selectors setting does not have an options box that can be used to retrieve a JSON value.
I tried use a#video-title this isn’t work
That element does not exist in the page when JavaScript is disabled. Maybe you can use meta[property=”og:title”] selector and “content” as its attribute. For more information on what data you can retrieve from a page, you can see this page.
It still error, but I want to list title in the playlist. Not only meta title page. https://ibb.co/N2zB8nC
Could you build a short tutorial for a crawler for this case?
First of all, the double quote character used in your CSS selector looks like it is different from the regular double quotes. Also, fixing the double quotes does not result in the playlist title, as I saw when I tested it in the demo of the plugin. The reason is that YouTube does not include the playlist title when I make a request via the demo. It looks like the site sends different page contents in certain cases. Hence, the plugin cannot get the title that does not exist in the response sent from the target site.
I just got 1 value in script, can I get multiple values at the same time?
I do not understand what you mean, unfortunately.
I take a title in JSON. I want to take many title. Can you help me?https://ibb.co/xXsVPTt
That setting is for finding post URLs in category pages. I do not understand why you configured it to find a post title.
I just want demo, You can ignore it. You would like focus support for many title 
Could you please explain what exactly you want to do after finding multiple titles? Also, do you want to find multiple titles in a category page? If so, you cannot save titles from category pages.
I know in a section category need URL. I want to crawl multiple title for content.
I also want to URL for category. By the way, you can support for both cases
If you want to retrieve multiple values from a JSON object, you can create different custom short codes, as it is explained here and include all of them into the post content. For example, if you want to retrieve 10 values from a JSON object, you can define 10 different custom short codes where each finds one of the 10 values you want. Then, you can simply add them into the post content as it is explained in the documentation.
If you want to retrieve the URLs from JSON objects in category pages, you can use the options box of Category Tab > Category Post URL Selectors setting. You need to add a different rule for each of the values.
I want to replace but not work, can you support for me? https://ibb.co/L93c1zJ
For the find input’s value to be considered as a regular expression, you need to check the “regex” checkbox. For more information, you can see this page.
I have retrieve url pages: https://ibb.co/HrZpSXr. When I crawl, tool just crawl first page: https://ibb.co/m8Zpv0z. I can do multiple page?
The manual crawling tool does not automatically retrieve the next pages of the categories. You need to explicitly define the category URLs in the manual crawling tool.
How to retrieve all post of the categories if it have next page?
You can enable automatic crawling.
Can you give me link to the tutorial?
Sure. You can see the video tutorial or the written tutorial. The other guides are available here. The documentation of the plugin is available here. The other video tutorials can be seen here.
when I test and manual crawling post has retrieve iframe but when I run autocrawling post has no iframe. I don’t know if I missed something during setup? What makes manual and autocrawling different?
You can see this FAQ.
I click save but my actions are not saved. I lost all my settings like when I didn’t have license
The plugin is extensively tested via automated browser tests before every release, by using the plugin just like a user does, such as changing inputs, clicking buttons, and saving settings. These tests are run in multiple environments. Hence, I can say, in confidence, that the plugin does not simply lose all the settings. If your settings are partially lost, you can see this troubleshooting item to learn how to avoid it in the future. If your settings are completely lost, then the issue might be related to another plugin, your theme or your browser extensions. You might try to disable other plugins, convert your theme to one of the official WordPress themes, and disable all the browser extensions. Then, once you see that the plugin’s settings are saved without any issues, you can enable the previously disabled third-party software one-by-one and, after you enable each item, try to save the settings of the plugin again. Once you see that the settings are not saved as expected, you can say that the cause of the issue is the last item you enabled.
Error when I add many URL category https://ibb.co/vh1Jk7s . Maybe the reason is my server. Thanks
In HTML have many have class the same. I want to retrieves content in .... of the 2nd. How do i get the content?
You can use the nth-child selector or + operator such as ”.class + .class”. Because the CSS selectors are not a feature that is developed by the plugin, finding CSS selectors is not covered by the support, unfortunately. You can find many tutorials about CSS selectors on the web.
the nth-child selector or + operator => You can give me link tutorial?
I do not know a specific tutorial, unfortunately. You can find many tutorials on YouTube or just by searching on the web via Google or another search engine.
I want add id=”preload” for this script https://ibb.co/ySWGxmh I tried setting as below: https://ibb.co/xftXZXh But not work
Have you tried making that replacement in Manipulate HTML Section > Find and replace in raw HTML setting?
I tried https://ibb.co/PrBjYRM but not work
There is a new line after the opening tag of the script element. Your find-replace rule does not account for that. That’s why your replacement does not work. Maybe you can make the replacement by using regular expressions like this:
Regex: Checked Find: (<script)(>[\s\S]+window.__PRELOADED_STATE__) Replace: $1 id="my-id" $2
It works for the case where only a single script element exists, which can be seen here. The regex might not work in your case, though. If it does not work, you can change the regex accordingly. If you need to know more about what regular expressions are, you can see the documentation of find-replace setting.
Can you help me change domain to active thanks so much!
Hi,
Please see this FAQ.
please can you give me your skype or telegram
Hi,
Support is provided only from this comment section and via the contact form available on my profile page.