Code

Web Themes & Templates Video Audio Graphics Photos 3D Files

Create an Envato Account Sign In

Discussion on WP Content Crawler - Get content from almost any site, automatically!

Discussion on WP Content Crawler - Get content from almost any site, automatically!

By turgutsaricam

3,660 sales

turgutsaricam supports this item

Supported

This author's response time can be up to 5 business days.

2655 comments found.

arnlweb Purchased

about 3 years ago

How to remove this code, after enable script remover option, this code still showing. please help.

‘);var c=function(){cf.showAsyncAd(opts)};if(typeof window.cf !== ‘undefined’)c();else{cf_async=!0;var r=document.createElement(“script”),s=document.getElementsByTagName(“script”)[0];r.async=!0;r.src=”//srv.clickfuse.com/showads/showad.js”;r.readyState?r.onreadystatechange=function(){if(“loaded”==r.readyState||”complete”==r.readyState)r.onreadystatechange=null,c()}:r.onload=c;s.parentNode.insertBefore(r,s)}; })();

turgutsaricam Author

about 3 years ago

Hi,

It happens when a piece of JavaScript code contains HTML tags in it. For example, consider the following “script” element:

<script>
  var x = '<div>Test</div>';
</script>

The “div” tag existing in the “script” element is a text, not an HTML element. However, because it is not currently possible to differentiate them via PHP, the plugin thinks it is an HTML element. To avoid this, you need to remove those “div” tags from the script element’s content. You can use Post Tab > Manipulate HTML Section > Find and replace in raw HTML setting to remove them. For example, to remove the above, you can add a find-replace rule like this:

Find: <div>Test</div>
Replace: (leave this empty to remove the found value)

To find the part that causes this in the target page’s source code, you can search for the partial JavaScript code you sent in the target page’s source code. Then, just before that code, you should see some HTML tags. Those are the tags that you need to remove by adding find-replace rules.

arnlweb Purchased

about 3 years ago

thanks

arnlweb Purchased

about 3 years ago

I need help to fix it… on Post Slug (Permalink) Selectors

https://www.example.com/2022/04/new-morshum-lyrics-x-equals-to.html https://www.example.com/2023/05/new-morshum-lyrics-x-equals-to.html https://www.example.com/2023/08/new-morshum-lyrics-x-equals-to.html

I want to remove “https://www.example.com/2022/04/, https://www.example.com/2023/05/, https://www.example.com/2023/08/" and ”.html” part from the url.

turgutsaricam Author

about 3 years ago

You can do it by using the find-replace options available in the options box of the Post Slug Selectors setting. You need to use regular expressions. You can find more information here.

As a note, you do not have active support. Support is provided only to the customers with active support.

about 3 years ago

Hello again, a client already has the plugin on his website, he asked me to scrape this page “flowhot[dot]cc”. I already have all the configuration and it works correctly. The detail that, for example, the “MP3” category currently has 189 pages (and counting), the detail that I can’t make it scrape the 189 pages, it only always does it up to the third page (only the first 3 pages). Each page has 250 post. How can you scrape the 189 pages that a category has?

screenshot1: https://i.imgur.com/kciXMU9.png

settings export: https://justpaste.it/77riy

site: flowhot[dot]cc

turgutsaricam Author

about 3 years ago

Hi,

It looks like the site sends a 404 response for page 4, as it can be seen in this screenshot: https://ibb.co/DWWJwxt

about 3 years ago

It’s true, mark 404, but the page does exist, why can this be? https://i.imgur.com/Kab9cKk.png

turgutsaricam Author

about 3 years ago

The target site returns a 404 response along with the data. However, the plugin stops processing a request when its status code is 404. The response’s status code can be seen in the developer tools of the browser: https://ibb.co/j56QJCY It is not possible to retrieve the content in this case, because there is no setting to disable the behavior that stops crawling when the status code is 404.

about 3 years ago

You are right, it is practically an error on that page, I tried an online seo checker, I put the link and it also gave 404. Then I will use another similar page to scrape. Thank you so much.

about 3 years ago

Hello, I have seen in some Youtube videos that Automatic can change languages several times like for example from English -> Spanish -> Russian and then to English again, in order to have a text not similar to the original, it is possible with Does your plugin translate several languages the content sequentially?

4 other replies

turgutsaricam Author

about 3 years ago

Hi,

The plugin is currently not capable of doing that, unfortunately.

about 3 years ago

Do you think that in a next version you can simply add an ‘Add new language’ and be able to add more than one, so that Google Api can easily translate it? This would be for the title and content, which is easy to apply, but I don’t know how you could apply it to the rest of the data. Perhaps noting what data selected in the configuration would be to transfer.

You can check on Youtube, so it is now scraper the data

turgutsaricam Author

about 3 years ago

I added this as a feature request, thanks. I could not get what you mean in the YouTube part, though. Could you please elaborate on the last part?

about 3 years ago

Of course, I want to say that I have seen on YouTube that they use the Wp Automatic plugin, and they show how to translate it into another language more than once, so that the text comes out differently, for example if you translate it from Spanish to English, then To Turkish, then German, and then to Spanish again, the text will be different from the first Spanish, with this more originality is achieved in the text. I am attaching the video link. Also, your plugin looks more versatile, so the translation must be very clear and I suppose you must add it in a way that is easy to understand and better than wp automatic.

https://www.youtube.com/watch?v=91oSYiesvQs https://i.imgur.com/Npiheg5.png

about 3 years ago

The translation must be by elements, for example it is not necessary to translate several times the words “Tags” or “Category” or the name of an artist of a selected item, but the text of the content and the title if it would be necessary in a scenario. It occurs to me that in the section that you have to translate, all the selected elements appear, then each element has to add languages to translate, then only what is chosen is translated, and they are also translated several times according to the languages that are added for that element. It can be seen in the form of a matrix for the simple reading of the option.

turgutsaricam Author

about 3 years ago

All right, I think I got it, thanks. I think adding this feature as a filter command is more suitable. It might not be added in the next version, but I plan to add it after that.

about 3 years ago

Yes, I also think the same, within filters you could add it and the possibility of being able to translate it several (X) times or only once.

Done thanks for the attention, I’ll be waiting for this version.

about 3 years ago

Hi, is it possible to scrape the different seller prices on this page and to insert them in custom fields? https://www.hind.ee/p/apple-iphone-13/

turgutsaricam Author

about 3 years ago

Hi,

It is not possible to get each price with their seller information separately and save them as custom fields, unfortunately.

about 3 years ago

(1) Can your plugin not download the image in category/post tab (2) Can your plugin change the crawled image URL by source image URL

I want to use the images from source. Don’t want to save them in my server

turgutsaricam Author

about 3 years ago

Hi,

If you do not specifically configure the plugin to save the images, it does not save the images by default. However, because WordPress cannot show a featured image via its external URL, the featured images are always downloaded.

about 3 years ago

thank you

basti4k Purchased

about 3 years ago

please add deepl for translate with deepl write for “no double content”

2 other replies

turgutsaricam Author

about 3 years ago

Hi,

DeepL Translate will be in the new version. DeepL Write does not seem to be available in the API. However, I am working on OpenAI GPT integration.

basti4k Purchased

about 3 years ago

very good sound! when is the new version coming?

turgutsaricam Author

about 3 years ago

The date is not certain yet. But, it will probably be released in a month.

basti4k Purchased

about 3 years ago

perfect. I am very happy about this news

turgutsaricam Author

about 3 years ago

If you have use-case ideas about OpenAI GPT, please join the Discord server so that we can discuss the ideas there. I would like to know them so that I can design the feature to cover all the use-cases.

gracehuynh Purchased

about 3 years ago

The plugin crawls too slowly, (save all content images option unchecked)—any way to improve it? Thanks

turgutsaricam Author

about 3 years ago

Hi,

If you want to save more posts in a specific time interval, you can increase the values of General Settings > Scheduling Tab > Run count for URL collection event and General Settings > Scheduling Tab > Run count for post crawling event settings. If crawling a single post takes too much time, it is probably related to the speed of the target site.

gracehuynh Purchased

about 3 years ago

thanks, btw what’s your suggest value for this option.

turgutsaricam Author

about 3 years ago

It depends on your needs and your server’s capacity. You can start by entering 2 and then observe if your server can handle it. Then you can gradually increase the value until you find the sweet spot.

andienhuynhngoc01 Purchased

about 3 years ago

Hi I can not crawl by your tool?

turgutsaricam Author

about 3 years ago

Hi,

Could you please tell me what exactly the problem is?

andienhuynhngoc01 Purchased

about 3 years ago

I actived it for a long time but nothing happend. I just can crawl every single post.

turgutsaricam Author

about 3 years ago

In that case, the settings under the Category tab might not be able to find post URLs. For the automatic crawling to work, the plugin needs to be able to find post URLs from category pages. How to configure and test the settings under the Category tab is explained here.

Tuncer-Karatastan Purchased

about 3 years ago

wp sitemi sub domaine aktardım lakin eklenti veri çekmeyi durdurdu ne yapmam gerekiyor?

turgutsaricam Author

about 3 years ago

Merhaba,

Eğer lisansınızın başka domainde olduğuna dair bir hata alıyorsanız, eski domaininizi lisansınızdan kaldırabilmem için lütfen lisans anahtarınızı profil sayfamda bulunan iletişim formunu kullanarak bana gönderin. Eğer başka bir sorun varsa, şu sayfayı inceleyerek sorununuza bir çözüm bulmayı deneyebilirsiniz: https://docs.wpcontentcrawler.com/troubleshooting/index.html

about 3 years ago

I want to copy all the content of a WP, but I think they are pages, not posts. Is it possible?

I can’t fill in any category because the posts have no category. I have tried to use the demo without success.

turgutsaricam Author

about 3 years ago

Hi,

It is possible to create pages with the plugin. A page is a custom post type. How to save custom post types is explained here.

About the category, a category is a page that contains the links of the posts. So, URL of any page that contains the links to the pages you want to save can be used as a category URL.

about 3 years ago

Thank you.

You say I can crawl a post and turn it into a page on my website.

But I don’t need to track post, I need to track pages. That’s possible? The pages have no category.

I want to track pages and copy them to my wp.

I use the demo but I am not able to do it

turgutsaricam Author

about 3 years ago

You can select a dummy category in Category URLs setting, because it does not matter for pages, if your concern is this. So, yes, it is possible. You can follow the instructions on the page whose link I sent in my previous comment to create pages.

georgetr1 Purchased

about 3 years ago

I have crawled content minified with a single line. How can I make paragraphs (p tag) starts from a new line, Thanks.

turgutsaricam Author

about 3 years ago

Hi,

It does not seem to be possible, unfortunately. However, after the post is created, you should be able to see the unminified version when editing the post, if that’s your purpose.

georgetr1 Purchased

about 3 years ago

Hi,

Does “SpinRewriter” function rewrite the content and title or only content?

turgutsaricam Author

about 3 years ago

Hi,

You can select which parts of the posts should be spun via Main Tab > Spinnable Fields setting, which becomes visible when you enable Main Tab > Active for post spinning setting.

georgetr1 Purchased

about 3 years ago

Thank you for your feedback

georgetr1 Purchased

about 3 years ago

It is possible to except tags from spun? For example I have a clean html content without any additional attributes like id, classes, etc. But sometimes after Spin rewriter at the end I got code with missing tags.

Is there any solution to this problem?

turgutsaricam Author

about 3 years ago

It looks like it is a problem related to SpinRewriter. Maybe, in your post content selectors, you can retrieve the “text” of the target elements instead of “html”, which will make the content text-only so that SpinRewriter does not have an opportunity to break the HTML structure.

smile4sammy Purchased

about 3 years ago

Hello, kindly reset my domain name so I can attach a new domain with my license. I do not have access to current domain showing on my license since I was unable to renew it.

Kindly reset domain to blank so I can start afresh

turgutsaricam Author

about 3 years ago

Hi,

Please send your purchase code via the contact form on my profile page so that I can remove the domain registered to your license.

about 3 years ago

Does wp content crawler upload the image twice, if it is in two posts?

turgutsaricam Author

about 3 years ago

Hi,

Yes, it does.

about 3 years ago

Your plugin is really very complete. I did some tests on your server and it is very promising. Maybe you already have this option, but I didn’t see it. I would like to put a Custom Post Meta as the caption of the Featured Image. It is possible? Perhaps there is a better way to do this, but a site externally placed the caption of the photo as <figcaption></figcaption> and it had not automatically gone as caption for the highlighted image.

turgutsaricam Author

about 3 years ago

Hi,

Thank you. The plugin does not retrieve the captions from figcaption elements, unfortunately. However, it automatically retrieves “alt” and “title” attributes’ values from the elements found by the image URL selectors. If you can use find-replace options to put the text of the figcaption element into the “alt” or “title” attribute of the image element, it should be automatically saved with the image.

If what you want to do is save the text of a figcaption element as the value of a custom post meta whose key is known by you, you can directly use Post Tab > Post Meta Section > Custom Meta Selectors setting to save it. However, this will not associate the value with the saved image, unfortunately.

about 3 years ago

Nice. Which find-replace would I use to do this? Because there is this option in several places.

turgutsaricam Author

about 3 years ago

Under Post Tab > Manipulate HTML Section, you can either use Find and replace in raw HTML or Find and replace in element HTML. Please note that you need to use regular expressions. If you do not know how to use them, you can use a regular expression like this. As a replacement, you can write something like this:

<figure><img title="$2" /></figure>

Please note that the plugin’s support does not cover this type of customizations. I am sending this just as an example. The example replacement rule also removes the figcaption element. This regex might not work for the code you have.

c4glr Purchased

about 3 years ago

Merhaba, eklentide URL toplama kısmı düzgün çalışıyor ancak yazı kaydetme kısmında sorun yaşıyorum. Bazen kaydetme hiç çalışmıyor bazen de her dakika kaydet seçmeme rağmen 5 dakikada bir ya da 1 saat aralıkla yazı kayıt ediliyor. Sunucu tarafında cronlarla ilgili logları talep ettim. Aşağıdaki gibi uyarılar var. Ne yapmam gerekiyor?

2023-01-24 23:34:51 Warning IP Adresi mod_fcgid: stderr: #1 /var/www/vhosts/website.com/httpdocs/wp-content/plugins/wp-content-crawler/app/Objects/Crawling/Savers/PostSaver.php(207): WPCCrawler\Objects\Crawling\Savers\PostSaver->savePost(22595, Array, false, true, false, false, false), referer: https://website.com/wp-cron.php?doing_wp_cron=1674592484.0154 Apache error

2023-01-24 23:34:51 Warning IP Adresi mod_fcgid: stderr: #2 /var/www/vhosts/website.com/httpdocs/wp-content/plugins/wp-content-crawler/app/Services/SchedulingService.php(167): WPCCrawler\Objects\Crawling\Savers\PostSaver->executePostSave(22595), referer: https://website.com/wp-cron.php?doing_wp_cron=1674592484.0154 Apache error

Teşekkür ederim.

turgutsaricam Author

about 3 years ago

Merhaba,

Logların detaylarını da gönderebilir misiniz? Tam olarak hatanın nereden kaynaklandığını göremezsem size yardımcı olmam mümkün değil.

c4glr Purchased

about 3 years ago

Merhaba, sanırım wordpress cronları düzgün çalışmıyor ve daha detaylı bir logda göremiyorum. Bunun yerine daha önce farklı kişilere önerdiğiniz https://www.hostgator.com/help/article/how-to-replace-wordpress-cron-with-a-real-cron-job linkini inceledim. Wordpress cronlarını pasife çekip, cpanel üzerinden cron eklesem eklentideki hangi cronları sunucu üzerinden çağırmam gerekiyor?

turgutsaricam Author

about 3 years ago

Merhaba,

Eklentiye özel bir şey yapmanıza gerek yok. Linkte bahsedildiği gibi, WordPress’in ana WP-Cron’unu tetiklemeniz yeterli.

c4glr Purchased

about 3 years ago

Merhaba, lisans aktivasyonu yapmama rağmen belirli aralıklarla bu uyarı önüme çıkıyor. Lisansı tekrar aktifleştiriyorum ancak bir süre sonra uyarı gelmeye devam ediyor. Yardımcı olabilir misiniz?

İçerik Toplayıcı için girdiğiniz lisans anahtarı geçerli değil veya kontrol edilemedi. İçerik Toplayıcı kullanmaya devam etmek için lütfen 20/01/2023 23:57 tarihine kadar lisansınızı girin.

Mesaj: Lisansınız sunucu ile kontrol edilemedi. Lütfen lisans ayarlarınızı birkaç dakika içinde tekrar kaydetmeyi deneyin. Uyarıyı görmeye devam ederseniz lütfen geliştirici ile iletişime geçin.

Özellikler pasif hale getirilmeden önce kalan deneme sayısı: 2

4 other replies

turgutsaricam Author

about 3 years ago

Merhaba,

Bunun birçok sebebi olabilir. Siteniz eklentinin lisans sunucusuna çok fazla istek gönderiyor olabilir. Lisans sunucusu bu durumlarda geçici süreliğine sunucunuzun IP adresini engeller. Bunun olma ihtimali eğer eklentinin dosyalarını değiştirmediyseniz çok düşük, çünkü eklenti lisans sunucusuna çok sık istek yapmaz. Fakat, bir nedenden dolayı çok istek gönderiliyorsa, o nedenle ara ara bu hatayı alıyor olabilirsiniz. Bir diğer neden, sitenizde bir önbellekleme eklentisi yüklü ise, o eklenti WordPress’e lisans sunucusuna daha önce yapılmış olan bir isteğin cevabını gönderiyor olabilir. Başka bir sebep olarak sitenizin sunucusu çok fazla yük altında kalıyor ve lisans kontrolü için yapılan isteği tamamlayamıyor olabilir. Bir diğeri, sunucunuz bir çeşit bağlantı sorunundan dolayı eklentinin sunucusuna bağlanırken sorun yaşıyor olabilir. Bunları eklenti tarafından düzeltmek mümkün değil, çünkü bunlar eklentiden kaynaklanan sorunlar değil. Belirttiğim nedenlerin kaynaklarını tek tek inceleyip çözmeyi deneyebilirsiniz.

c4glr Purchased

about 3 years ago

Hızlı yanıtınız için çok teşekkür ederim sunucu tarafını kontrol edeceğim.

Son bir sorum olacak. Eklentiniz ile çektiğim içerikleri woocommerce ürünü olarak web siteme kayıt ediyorum. Ancak çektiğim ürün thumbnail görselini kendi sunucuma kayıt etmeden external link üzerinden göstermek istiyorum mümkün müdür?

EXMAGE – WordPress Image Links eklentisi ile external ürün görsel linklerini kendim ürün girerken tanımlayabiliyorum. Eklentiniz ile yine bu şekilde external ürün görseli tanımlamam mümkün olabilir mi?

turgutsaricam Author

about 3 years ago

Eğer EXMAGE isimli eklenti, öne çıkan görselin URL’sini yazının özel alanlarında veya taksonomi değeri olarak tutuyorsa, eklenti ile bu değerleri kaydedebilirsiniz. Aşağıdaki linklerden bu değerleri nasıl kaydedebileceğinizi öğrenebilirsiniz:

Özel alanların değerini kaydetmek için: https://docs.wpcontentcrawler.com/guides/saving-post-meta-custom-fields.html

Taksonomi değerlerini kaydetmek için: https://docs.wpcontentcrawler.com/guides/saving-taxonomy-values.html

c4glr Purchased

about 3 years ago

Merhaba, veri tabanından EXMAGE eklentisinin meta değerlerini kontrol ettim ve aynısını sizin eklentinize tanımlayarak ilgili ürün içeriği için doldurdum. Veritabanını kontrol ettiğimde bu kayıtların başarılı bir şekilde atıldığını da gördüm ancak ürün görselleri görünmüyor. Ürünleri tek tek kontrol ettiğimde de o ürünlere external link tanımlanmamış.

Metaları tanımladığım için ürün görsellerini kaydet seçeneğini hem kategori hem de yazı tarafında pasife almıştım. Meta değerleri external linkler ile veritabanında dolmasına rağmen ürün görselleri neden boş gelir eksik yaptığım bir şey mi var konu sizinle ilgili olmayabilir ancak yardımcı olabilirseniz çok sevinirim.

Ürün görsellerini EXMAGE ile external link için manuel tanımladığımda veri tabanında ilgili linki 2 tane meta kaydıyla görüyorum. Sizin eklentinizle de aynı şekilde meta kayıtlarını atıyorum. Bunlar _wp_attached_file ve _exmage_external_url değerleri görünüyor. Veri tabanı ekran görüntüsü -> https://ibb.co/Mp2CSbk

Meta değerleri için sizin eklentinizde yaptığım tanımlamalar ise şöyle -> https://ibb.co/b66KxKN

Teşekkür ederim.

turgutsaricam Author

about 3 years ago

Yazı Sekmesi > Diğer Bölümü > save_post kancasını tetikle seçeneğini işaretleyerek tekrar dener misiniz?

c4glr Purchased

about 3 years ago

Merhaba denedim yine ürün görselleri görünmüyor. Veri tabanına kayıtlar atılmış. Ancak ürünlere tek tek baktım link kayıt edilmemiş görünüyor.

turgutsaricam Author

about 3 years ago

Bu durumda eklenti tarafından yapabileceğiniz ekstra bir şey ne yazık ki yok. Belki farklı bir özel alan değeri daha kaydetmeniz de gerekiyor olabilir.

about 3 years ago

How to replace the featured image URL before importing

Feature image on category page: https://fancy4talk.com/wp-content/uploads/2023/01/xxxx-300x300.jpg The real featured image: https://fancy4talk.com/wp-content/uploads/2023/01/xxxx.jpg

Can I remove -300×300 before getting the featured image Thanks

4 other replies

turgutsaricam Author

about 3 years ago

Hi,

Yes, you can do it by using Post Tab > Featured Image Section > Find and replace in featured image URL setting. To remove that part, you can configure it like this:

Regex: Unchecked
Find: -300x300
Replace: [leave this input empty]

For more information on how to use the find-replace settings, you can see its documentation.

about 3 years ago

I get featured image from category page, is that the same.

turgutsaricam Author

about 3 years ago

Yes, it is. You need to use the setting that is available under the Category Tab. You can simply configure the setting and use the Tester page to see if your configuration works as expected.

about 3 years ago

I tried but it didn’t work. On the tester page, I can’t see the featured post value. only title and content

turgutsaricam Author

about 3 years ago

You should test a category page. The post page tests do not show the featured images collected from the category pages. If the featured images are available in the post pages, I recommend you to retrieve them from the post pages instead of the category pages.

about 3 years ago

I face with this problem [17-Jan-2023 17:07:45 UTC] WPCC – No URL is found in the database. Site ID to check: 220, Last Crawled URL ID: does not exist [17-Jan-2023 17:07:45 UTC] WPCC (error): URL does not exist in the database.

turgutsaricam Author

about 3 years ago

It means there are no post URLs waiting to be crawled in the database. Your category settings might not be correct, or all the posts waiting in the queue are already crawled.

about 3 years ago

I bought 2 licenses because of a mistake, can I have a refund of 1. Thanks!

turgutsaricam Author

about 3 years ago

Hi,

Could you please send a refund request through CodeCanyon?

mj2385 Purchased

about 3 years ago

which email do I send a support request to change the domain? I don’t have access to the previous one, and I’m assuming it’s not advisable to post my purchase code here

turgutsaricam Author

about 3 years ago

Hi,

You can use the contact form on my profile page.

mj2385 Purchased

about 3 years ago

sent. thanks

$29

Included: Quality checked by Envato
Included: Future updates
Included: 6 months support from turgutsaricam

Extend support to 12 months $21 $9

Quantity

Price is in US dollars and excludes tax and handling fees

This item was featured on CodeCanyon

Elite Author

turgutsaricam

Last Update	7 September 2024
Gutenberg Optimized	Yes
Compatible Browsers	Firefox, Safari, Opera, Chrome, Edge
Compatible With	WooCommerce 8.x.x, WooCommerce 7.x.x
Files Included	JavaScript JS, CSS, PHP
Software Version	WordPress 6.5.x, WordPress 6.4.x, WordPress 6.3.x, WordPress 6.2.x, WordPress 6.1.x, WordPress 6.0.x, WordPress 5.9.x
Tags	ai content, autoblog, automatic, automatically, bot, chatgpt, content, post, product, programmatically, scheduling, scrape, scraper, woocommerce
Created	10 years ago

More Attributes Fewer Attributes

© All Rights Reserved turgutsaricam
Contact the Envato Market Help Team

Your all-in-one
creative subscription

AI tools, with the best AI models

Unlimited downloads on stock assets

Subscribe to Envato Elements.

Collage of Elements items

by

by

by

by

by