Code

Discussion on WP Content Crawler - Get content from almost any site, automatically!

Discussion on WP Content Crawler - Get content from almost any site, automatically!

Cart 3,660 sales

turgutsaricam supports this item

Supported

This author's response time can be up to 5 business days.

2655 comments found.

Hello. I need to move the license to a new domain. I just changed the name of the site. How do I transfer the license or get a new one? thanks

Hi,

Please see this FAQ.

Will this scrape image inside directories? Can I set this to scrape for ALL images recursively inside a wordpress upload folder(s)?

Hi,

The plugin can only save the images inside the posts, and it cannot recursively find the post URLs, unfortunately.

Hi, is this plugin work with Classified Listing Pro plugin, i mean can i get other listing from other website or only work with post like a blog ? Thanks

Hi,

I am not familiar with the inner workings of the specified plugin, but WP Content Crawler can save the posts as custom post types. It is also possible to save plain text values for custom fields and taxonomies.

I didn’t find in docs how to update the plugin to new version. Do you have link to this?

Hi,

You can update the plugin in the same way you update any other plugin. You have two options:

  1. Deactivate the plugin, delete it, and install the new version. Your site settings will not be deleted. They are stored in the database. So, removing the plugin’s files from your site or deactivating the plugin does not delete the settings.
  2. On the sidebar of your admin panel, go to Dashboard > Updates page. If there is a new version, it will be displayed there. Then, you can update the plugin from that page. Please note that if you did not enter your purchase code, an update will not be displayed there, even if there is a new version.

I get an error when I use tool below: https://ibb.co/0VzvjBg. I try to access the link and it says to check the robot. Can I fix this error?

Hi,

This error does not seem to be related to the plugin. You can try to find a solution online by searching the error you received on the Internet. For example, there is this article.

Hi,

It looks like something installed on your WordPress site prevents the plugin from connecting to the license server, as one of your screenshots show that your server is capable of connecting to the license server. Maybe you can enable the debugging feature of WordPress, reproduce the issue, and then check the debug.log file to see if the error is written there.

I crawl tags from website, but it set tags in javascript code like this
<script type="text/javascript">var ......
window.dataLayer=window.dataLayer||[];dataLayer.push({'pageCategory':'1001009'});dataLayer.push({'pageType':'Article'});dataLayer.push({'pagePlatform':'Web'});dataLayer.push({'pageSubcategory':'Thường thức'});dataLayer.push({'articleId':'4510532'});dataLayer.push({'articleTitle':'Những nền văn minh lâu đời nhất hành tinh'});dataLayer.push({'articleAuthor':'1700000248'});dataLayer.push({'articleAuthorName':''});dataLayer.push({'articlePublishDate':'20220914200000'});

...
dataLayer.push({'articleTags':'nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại'});
...

dataLayer.push({'tag_id':'51467, 1527186, ...https://vnexpress.net/nhung-nen-van-minh-lau-doi-nhat-hanh-tinh-4510532.html"/>
I use Regex for get tags name (\barticleTags\’:\’([^;]+)\b) and get the value as nền văn minh, nền văn minh lâu đời nhất, Ai Cập cổ đại

Could you please show me how to import this value to tags?

I did all suggest in this page, and I set my setting in demo page: penguinlvt (509812). Could you please advise me?

The posts are crawled. But, because you configured the post date selectors to find a date that is different from today, you do not see the crawled posts in the dashboard. You can track the number of saved posts by looking at the “all” column: https://ibb.co/WFLQB79

Oh yeah, I see! https://ibb.co/BKdt3B4 It crowled like crazy hihi

I deleted all post then I use “tool” retrieve but It’s error https://ibb.co/LR66rRW

Hi,

If you are checking the duplicates via the post title and/or content, there might be another post that matches the criteria. Maybe you can check the duplicates only via URL.

when I used “tester” it’s work ok, and have image in the content. But auto post is it’s not work, not image in content, link image change to error link https://ibb.co/DzfQbVd

Hi,

You need to put the image URL into the “src” attribute of the “img” element. You can see this page to learn how to do that.

I change domain, import setting but It’s not work in new domain https://ibb.co/PZgmxTW. Old domain still work

Hi,

The target website might be blocking your new server, since it sends “503 Service Unavailable” response. You might be able to access it by using a proxy or changing the request headers such as HTTP User Agent header. In any case, the issue is not related to the plugin, because the plugin successfully makes a request to the target site.

I use tab https://ibb.co/fvBfG4V. All content retrieved but not in tab.

Hi,

If the tab’s contents are created via JavaScript, the plugin cannot retrieve them. For more information, you can see this page.

I retrieved the content. With my web, I want to post the content in the tab but when I use “tester” it doesn’t cut the content into the tab. I tried use “tool” manual post, it doesn’t work either

The tabs are created via JavaScript. You should configure your theme to convert the contents into tabs. You cannot create the tabs via the plugin.

I have this code in html:

Tags: <a href="http://redsvn.net/tag/nhat-ban/" rel="tag">Nhật Bản</a>, <a href="http://redsvn.net/tag/the-chien-2/" rel="tag">Thế chiến II</a><br /><br /> <div id="fb-root" />

It showed in web like this: https://ibb.co/zsh4zGj And I get tags name by this setting: https://ibb.co/ysx2mfG Now, when I remove this code in raw HTML or in HTML at first load, it can not crawl tags name. https://ibb.co/4fjvfL2 How I can remove this code and keep tags name?

Thank you

Ps: I replace this code to <!-Tags…-> but it cannot get tags name too.

Hi,

You can use Templates Tab > Manipulate HTML Section > Find and replace in post’s content setting to make the replacement. The rules defined in this setting are executed after the tags are retrieved. You can see when exactly each setting is applied here.

I tried the demo, how do I search and replace in text-only paragraphs?

Hi,

You cannot directly find and replace in text-only paragraphs, unfortunately. However, you can find and replace in the element that contains the paragraph by using, for example, Post Tab > Manipulate HTML Section > Find and replace in element HTML setting.

Hello, I love the plugin, thanks!

I am having a couple of small issues though and wondering if you can help… I’m using the plugin on a child site to crawl a category on a parent site. I have successfully managed to get the crawl to work and post the posts in the category on the parent site to the child site. However:

1) The featured image quality is low 2) The Comments do not work on the new post (despite the tick box in General settings -> Posts being clicked)

Any ideas?

Thanks.

amazing, so just sit back and wait. thanks again!! have a great day!

Thanks, you too.

If the recrawling interval is too long, you can also manually recrawl them by using Tools Page > Manual Recrawling Tab.

319 / 5 000 Hello turgutsaricam, I would like to use your great plugin, I tried it a while ago on a previous server, but I already deleted it. Now I would like to use it on a new domain, but it won’t let me, it says This license has reached its domain limit and is not valid for this domain. Registered domains: hosting136592.a2e49.netcup.net How can I move it again? Many thanks: Tamas

Hi,

Could you please send your purchase code via the contact form on my profile page so that I can remove the registered domain from your license?

Hello, why caught the posts in the queue and cannot be saved?

Hi,

You can see this page to troubleshoot the automatic crawling problems.

hi, are you sure that the new MS Translate API (portal.azure.com) working fine? The Google API works fine, but not Mircosoft:

Message: Exception
Details: Response could not be retrieved from Microsoft Translator Text API with https://api.cognitive.microsofttranslator.com/translate?api-version=3.0&from=es&to=de&textType=html - file_get_contents(https://api.cognitive.microsofttranslator.com/translate?api-version=3.0&from=es&to=de&textType=html): failed to open stream: HTTP request failed! HTTP/1.1 401 Unauthorized (2)
Type: error

Hi,

Because the plugin does not have an option that you can use to specify a region, when creating the API key on Microsoft Azure Portal, please make sure you create a global API key, not a region-specific one.

ah okay, i set it on global:

{code provided location ‘global’ is not available for resource group. List of available regions is ‘centralus,eastasia,southeastasia,eastus,eastus2,westus,westus2,northcentralus,southcentralus,westcentralus,northeurope,westeurope,japaneast,japanwest,brazilsouth,australiasoutheast,australiaeast,westindia,southindia,centralindia,canadacentral,canadaeast,uksouth,ukwest,koreacentral,koreasouth,francecentral,southafricanorth,uaenorth,australiacentral,switzerlandnorth,germanywestcentral,norwayeast,jioindiawest,westus3,qatarcentral,swedencentral,australiacentral2’.”}

I see the “global” region when creating a translator as it can be seen in this screenshot. If you have trouble finding it, you can contact the support of Microsoft Azure.

I want to transfer license of the domain luxurynuochoa.com to subpok.com.

Have you tried making that replacement in Manipulate HTML Section > Find and replace in raw HTML setting?

I tried https://ibb.co/PrBjYRM but not work

There is a new line after the opening tag of the script element. Your find-replace rule does not account for that. That’s why your replacement does not work. Maybe you can make the replacement by using regular expressions like this:

Regex: Checked
Find: (<script)(>[\s\S]+window.__PRELOADED_STATE__)
Replace: $1 id="my-id" $2

It works for the case where only a single script element exists, which can be seen here. The regex might not work in your case, though. If it does not work, you can change the regex accordingly. If you need to know more about what regular expressions are, you can see the documentation of find-replace setting.

Can you help me change domain to active thanks so much!

Hi,

Please see this FAQ.

please can you give me your skype or telegram

Hi,

Support is provided only from this comment section and via the contact form available on my profile page.

by
by
by
by
by
by

Tell us what you think!

We'd like to ask you a few questions to help improve CodeCanyon.

Sure, take me to the survey