Discussion on WP Content Crawler - Get content from almost any site, automatically! (Page 92)

turgutsaricam supports this item

Supported

This author's response time can be up to 5 business days.

2655 comments found.

theallegrarr

about 7 years ago

can i crawl cnn.com for articles with your crawler?

turgutsaricam Author

about 7 years ago

Hi,

Yes, you can. However, you need to remove some text from the page that cause HTML to be parsed incorrectly. You can go to “Manipulate HTML > Find and replace in raw HTML” and configure it as follows:

Regex: Checked
Find: <script[^>]*>.*?<\/script>
Replace: Leave this input empty

telechargertn

about 7 years ago

Hello; I would like to ask a question, does the plugin allow to extract tags defined example X1, X2, etc. in a page www.website.com/content-page1 and created from these fields a user with fields X1, X2, etc..

turgutsaricam Author

about 7 years ago

Hi,

The plugin does not have a feature that can be used to create users, unfortunately

telechargertn

about 7 years ago

Ok can you please, give me an example to extract a page content from Wikipedia https://fr.wikipedia.org/wiki/Banco_Comercial_Portugu%C3%AAs without the plugin extracting the side table . Is it possible? I tried the demo but I can’t extract just the text and titles from this page.

turgutsaricam Author

about 7 years ago

If you cannot select elements directly, i.e. without the elements you do not want, you can remove the unwanted elements by entering their selectors into “unnecessary elements selectors” option.

gertsnijder Purchased

about 7 years ago

Hi, I bought your plugin! After try to use Scraper en Scrapes, your plugin is the best with the most functions!!

I have 2 questions,

1. i crawl sales products from some webshops. I need an option to get an input for an custom field. The field is an end date off sale coupon.

I need to generate: “Crawl date + 7 days” for example. Do you have any idea for this to fix. Output must be 2019-03-08.

2, I try to make title off product with first letter capital i found next: functx:capitalize-first(‘[wcc-main-title]’)

This is not working, is there an way to get that working also?

2 other replies

turgutsaricam Author

about 7 years ago

Hi,

Thanks, I’m glad you like the plugin.

1. There is currently no functionality that can be used to perform calculations on dates, unfortunately.

2. There is no easy way to capitalize the first letter. However, you can use find-replace options with regular expressions to change the first letter with anything. In your case, you need to add find-replace options for every possible character. For example, if you enter “^a” for find input and enter “A” for replace input by making sure that regex checkbox is checked, lowercase “a” will be replaced by uppercase “A”. You can add all characters existing in your alphabet in this format to make the first letter uppercase.

gertsnijder Purchased

about 7 years ago

Hi,

1. Is there a shortcode or something to display last crawl date? That function I find in Scrapes.

2. Ahhw than it’s better to do it with CSS in the theme.

turgutsaricam Author

about 7 years ago

There is no function to show date, unfortunately. Could you please tell me what you mean by “last crawl date”? I will add this as a feature request but I need to understand what you mean exactly.

gertsnijder Purchased

about 7 years ago

I mean the current date. Date off scraping.

turgutsaricam Author

about 7 years ago

I added this as a feature request. Thanks.

Globalh Purchased

about 7 years ago

Merhaba 3 ay önce ürünü aldım. Bugün lisans süresinin dolduğunu belirten bir mesaj gördüm. 6 aylık bir destek uzatması aldım fakat aynı uyarı mesajı hala durmakta. Ne yapmam gerekir.

4 other replies

turgutsaricam Author

about 7 years ago

Merhaba,

Lisansınız ömür boyu geçerlidir. Desteği uzatmanız lisansınızı herhangi bir şekilde etkilemeyecektir. Aldığınız uyarı mesajında mesaj bölümünde ne yazıyor acaba?

Globalh Purchased

about 7 years ago

İçerik Toplayıcı lisansınızın süresi doldu. İçerik Toplayıcı kullanmaya devam etmek için lütfen 04/03/2019 20:47 tarihine kadar yeni bir lisans edinin.

Mesaj: This license has reached its domain limit and is not valid for this domain. Registered domains: www.**.com

Alan adını ben sildim.

turgutsaricam Author

about 7 years ago

Aynı lisansla eklentiyi sadece bir domainde aktif olarak kullanabilirsiniz. Diğer domaininizde eklentiyi pasif hale getirip lisans ayarlarını tekrar kaydettiğinizde mesaj kaybolacaktır. Eğer sildiğiniz domain ile eklentinin yüklü olduğu domain aynıysa, sorunu inceleyebilmem için lütfen admin panel ve FTP giriş bilgilerinizi profil sayfamdaki iletişim formunu kullanarak gönderin.

Globalh Purchased

about 7 years ago

Evet domain aynı fakat dün başka bir serverdan taşıdım ve inaktif etmeden eski serverı sildim.

turgutsaricam Author

about 7 years ago

Profil sayfamdaki iletişim formunu kullanarak lisans anahtarınızı gönderebilir misiniz? Kayıtlı domaininizi sileyim, lisans ayarlarınızı tekrar kaydetmeyi deneyin.

Globalh Purchased

about 7 years ago

Yardımınız için teşekkürler, düzeldi.

turgutsaricam Author

about 7 years ago

Rica ederim, düzelmesine sevindim.

teknoemre Purchased

about 7 years ago

Merhaba, siteyi test et butonuna bastığım zaman

An error occurred. 502 Bad Gateway The server returned an invalid or incomplete response.

hatası alıyorum ne yapmam gerek acil yardımcı olursanız çok sevinirim, teşekkürler.

13 other replies

teknoemre Purchased

about 7 years ago

sürekli 502 Bad Gateway The server returned an invalid or incomplete response. hatası alıyorum lütfen yardımcı olursanız çok sevinirim

teknoemre Purchased

about 7 years ago

hatta siteyi çökertiyor

turgutsaricam Author

about 7 years ago

Merhaba,

Bu hatayı nerede alıyorsunuz?

teknoemre Purchased

about 7 years ago

yeni site eklerken yani kategori testini yaparken

teknoemre Purchased

about 7 years ago

css testi

teknoemre Purchased

about 7 years ago

daha sonrada site çöküyor bi süreliğine sonra tekrar düzeliyor

teknoemre Purchased

about 7 years ago

eklentiyi satın aldım ama daha kullanamadım bile çünkü sorun oluyor lütfen biran önce yardım edebilirseniz siteme içerik ekleyeceğim, teşekkürler..

turgutsaricam Author

about 7 years ago

Sorun sunucunuzla alakalı gibi görünüyor. Sorunun sebebini inceleyebilmem için admin panel ve FTP giriş bilgilerinizi buradaki profil sayfamda bulunan iletişim formunu kullanarak gönderebilir misiniz?

teknoemre Purchased

about 7 years ago

iletişim formunun linkini yollayabilirmisiniz hemen göndereyim sizde lütfen hemen inceleyin

teknoemre Purchased

about 7 years ago

bugün bu işi halledelim lütfen

turgutsaricam Author

about 7 years ago

İletişim formuna ulaşmak için nickime tıklayabilirsiniz.

teknoemre Purchased

about 7 years ago

tamamdır sftp bilgilerini yolladım sizden en kısa sürede geri dönüş bekliyorum teşekkürler

teknoemre Purchased

about 7 years ago

tüm bilgileri yolladım Turgut bey lütfen sorunu çözünce geri dönüş yaparsanız, sizden haber bekliyorum..

turgutsaricam Author

about 7 years ago

Size zaten dönüş yaptım. Sürekli hızlı bir dönüş beklediğinizi belirtmenize gerek yok.

teknoemre Purchased

about 7 years ago

Anlıyorum ama size yolladığım siteyi test ederken bende hep 502 Bad getaway hatası verdi

turgutsaricam Author

about 7 years ago

Hem burdan hem de mailden aynı şeyi neden yazdığınızı anlayamıyorum. Size her iki platformdan da aynı şeyleri söylemek tam olarak vakit kaybı. Ya buradan yazın, ya da mail atın lütfen.

telechargertn

about 7 years ago

hello, pre-purchase question, the plugin contains a field to put an featured image from the media? I want all articles to have a unique and predefined photo (photo media link) and I don’t want to download those presented in the article. is it possible?

turgutsaricam Author

about 7 years ago

Hi,

No, the plugin does not have that feature, unfortunately. You can maybe use find-replace options to add an img element into the target site’s source code and write a CSS selector that finds that image element as featured image URL. However, the plugin will still save it even if it exists in your site.

corpor8chic

about 7 years ago

Hello,

I have an RSS feed for my Deviant Art group, however the feed works fine but sometimes the Mature filter stops all the content from getting to my site I need a cookied session to be recorded to prevent this from happening can your plugin do this?

2 other replies

turgutsaricam Author

about 7 years ago

Hi,

Yes, you can use cookies when crawling a website using the plugin. You can assign the cookies under Main tab of site settings.

corpor8chic

about 7 years ago

Hello thank you for responding,

If I need help with configuring the feed with your plugin can you assist me with this..I currently have the feed running on Wordpress Automatic.

Thank you.

turgutsaricam Author

about 7 years ago

The plugin’s support does not cover configuration of settings, unfortunately.

corpor8chic

about 7 years ago

ah so what if I can’t get it working..then what?

turgutsaricam Author

about 7 years ago

In that case, if you think that the plugin has a problem, you can ask for a refund. If you do not know how to use a feature, you can ask how to use it.

njamal22

about 7 years ago

Hi,

I really think this plugin can solve my problem. However, I am not deep into programming and CSS selectors. Would you be available to help for a few sites in return for an extra donation? I want to crawl articles on 5-6 pages. I want to display the search results on those pages on my website. Much appreciated, Nihad

turgutsaricam Author

about 7 years ago

Hi,

Unfortunately, I do not provide configuration services even if money is involved.

majesti77

about 7 years ago

Hello, I need a plugin that allows me to add 100’s of urls for single product pages with the ability to add specific taxonomies to each individual url (product). I need a way to do this in bulk without having to do each url manually. This will be used for Woocommerce products and I would like to have it so that the image urls are not downloaded to my server. Is this possible with this plugin?

3 other replies

turgutsaricam Author

about 7 years ago

Hi,

Yes, it is possible.

majesti77

about 7 years ago

Thank you… Do you have a video that shows how its done?

turgutsaricam Author

about 7 years ago

You can check this video out: https://www.youtube.com/watch?v=2MYWzq_oAig It mentions the features you want to use. You can click the time marks available in the video description to watch only the parts you need.

majesti77

about 7 years ago

Thank you. Lastly, is it possible to import urls via wp all import or with CSV. files?

turgutsaricam Author

about 7 years ago

No, it is not possible to do so using them. However, you can use Tools page to manualy import URLs. You can enter new line separated URLs in manual crawling tool and click “add to database” button. After that, the plugin will crawl those URLs using scheduling options you defined.

majesti77

about 7 years ago

That sounds about perfect…. I will be purchasing tonight when I get home from work. thank you

rejek

about 7 years ago

Hello, I’m interested in your plugin.

can you make a tutorial for scrap UDEMY website?

turgutsaricam Author

about 7 years ago

Hi,

You can watch the tutorials available in this playlist to learn how to use the plugin.

tuncelhezer Purchased

about 7 years ago

turgut hocam bu bot kendını guncellıyor degılmı,extra bırsey yapmaya gerek varmı

3 other replies

turgutsaricam Author

about 7 years ago

Merhaba,

Eklenti kendi kendini güncellemez. Güncelleme olduğunda diğer eklentilerde olduğu gibi eklentiler sayfasında bir güncelleme mesajı görürsünüz. Güncelleme butonuna veya linkine tıkladığınızda güncellenir.

tuncelhezer Purchased

about 7 years ago

turgut hocam bır sorum daha var musadenlu,bot tek bır sayfadan yazı cekicek,1 sayfada 10 adet yazı var,bunlar kendılıgınden guncellenıyor,tek bır sayfa oldugu ıcın sonrakı sayfa ozellıgı yok , yani Kategori Sonraki Sayfa URL’si Seçicilerini kullanamıyorum,ben bu sayfada 10 adet linki araçlardan manuel eklesem , bot içerik güncellendiği zaman kendisi otomatik guncellermı

turgutsaricam Author

about 7 years ago

Evet, günceller.

tuncelhezer Purchased

about 7 years ago

turgut hocam guncellemıyor acaba kopya yazı olarak goruyor ondanmı,cunku başlık aynı yazıda

tuncelhezer Purchased

about 7 years ago

manuel olarak guncellıyor

turgutsaricam Author

about 7 years ago

Lütfen eklentinin genel ayarlar sayfasında bulunan yazı güncelleme ayarlarının doğru ayarlandığından emin olun. Diğer destek talepleriniz için lütfen satın aldığınız desteği güncelleyin.

tuncelhezer Purchased

about 7 years ago

merhabalar,yazı baslıgının sonuna günün tarihini nasıl eklerım acaba

tuncelhezer Purchased

about 7 years ago

turgut hocam bir sorun daha olucak, içerik çekilen site yeni yazı eklediği zaman o yazılarda eklenıyormu

turgutsaricam Author

about 7 years ago

Merhaba,

Hedef siteden herhangi bir veriyi kısa kod olarak kullanabilirsiniz. Kısa kod tanımlamak için “Yazı > Özel Kısa Kodlar > Kısa Kodlar için İçerik Seçiciler” seçeneğini kullanabilirsiniz. Tarih değerini bir kısa koda atadıktan sonra, “Şablonlar > Yazı Başlığı Şablonu” seçeneğini kullanarak başlığa ekleyebilirsiniz.

Eklenti hedef sitedeki kategori sayfalarını programlama seçeneklerinde belirtilen zaman aralıklarında kontrol eder. Eğer yeni bir yazı varsa, URL’sini sıraya ekler ve yazı kaydetme zamanı geldiğinde de yazıyı kaydeder.

marselpreci Purchased

about 7 years ago

Hi there I want to replace first 3 letters of an output. can you help me please

turgutsaricam Author

about 7 years ago

Hi,

You can do it by using find-replace options with regular expressions. Here is an example: https://regex101.com/r/QUlfXj/2 To use this, go to the find-replace option, check “regex” checkbox, write the regex into “find” input, and write ”$1” into “replace” input. Configuration of the settings is not covered by the support, unfortunately. You can find many tutorials about regular expressions on the web.

locnd88 Purchased

about 7 years ago

Hello again. Is that possible to remove link text in post source?

1 other reply

turgutsaricam Author

about 7 years ago

Hi,

Could you please explain what you mean by “remove link text”? Do you want to remove text of a link?

locnd88 Purchased

about 7 years ago

For example in this picture: https://drive.google.com/file/d/15AD2sAW2bTDwgHLOmil7BbNgE9177b6X/view?usp=sharing. I would like to remove all links from the text link but keep the text.

turgutsaricam Author

about 7 years ago

To remove links, you can check “Templates > Quick Fixes > Remove links from short codes” option.

locnd88 Purchased

about 7 years ago

Thank you for your help.

OmriS

about 7 years ago

Pre purchase question. Can I add a url after product url? So I want to take content from a site and add my affiliate url to the end of each url. Is this possible? I’ll probably have more questions but will start with this

turgutsaricam Author

about 7 years ago

Hi,

Yes, it is possible.

webadictos Purchased

about 7 years ago

Hi, i setup the plugin correctly and started to crawl the pages… but i see that in the first crawl doesnt save the featured imagen and the other images on the post.

If i go tools -> manual recrawling then it saves correctly….

My question is if you wonder why doesn’t save correctly the images the first time… or how can i manually recrawling all the posts ids with a cron job.

Best Regards,

turgutsaricam Author

about 7 years ago

Hi,

You might have configured the settings such that the plugin retrieves the featured images from the category pages as well as from the post pages. If your category settings are not correct, the plugin might have been matching the post URLs and featured images incorrectly. So, please check your category settings using the tester. If the featured images are not matched correctly, check “Category > Featured Image > Post links come before featured images” option and try again. Note that after you change the category options, you should invalidate already-collected URLs so that the plugin crawls them correctly this time. Otherwise, previously-collected post URLs will still be saved using an incorrect featured image. You can invalidate the URLs from “Tools > URLs > Clear URLs” tool.

If you want to manually recrawl posts, you can use “Tools > Manual Recrawling > Manually recrawl (update) a post” tool. You cannot manually crawl the posts with a CRON job since the plugin does not have a feature for that purpose. CRON jobs are used to do things automatically, not manually.

xthengu Purchased

about 7 years ago

Hi,

I bought your plugin but did not receive the license. It does not exist in the zip file. Please tell me where I can get it.

turgutsaricam Author

about 7 years ago

Hi,

Please see this FAQ: https://codecanyon.net/item/wp-content-crawler-get-content-from-almost-any-site-automatically/15983018/faqs/29730

sanayar2008

about 7 years ago

Hello

can the plugin grabbing from PHP websites ?

turgutsaricam Author

about 7 years ago

Hi,

Language used in the backend does not matter. The plugin can crawl any HTML code.

kdiscovery

about 7 years ago

hi,

i see the description page says your plugin is able to parsing Json. how to achieve this ? i did not find the Json option for css selector .

i ask this is because we plan to use your plugin to crawl a api json feeds.

waiting for your reply

turgutsaricam Author

about 7 years ago

Hi,

You can watch this to learn how to extract JSON values: https://www.youtube.com/watch?v=2MYWzq_oAig&t=727s

trueWD

about 7 years ago

Dostum işte alıcılara cevap vermemek için burda script yazmayı bıraktım. gerçekten sabır gerektiren bir iş :))

$29

Included: Quality checked by Envato
Included: Future updates
Included: 6 months support from turgutsaricam

Extend support to 12 months $21 $9

Quantity

Price is in US dollars and excludes tax and handling fees

This item was featured on CodeCanyon

Elite Author

turgutsaricam

View Portfolio

Last Update	7 September 2024
Gutenberg Optimized	Yes
Compatible Browsers	Firefox, Safari, Opera, Chrome, Edge
Compatible With	WooCommerce 8.x.x, WooCommerce 7.x.x
Files Included	JavaScript JS, CSS, PHP
Software Version	WordPress 6.5.x, WordPress 6.4.x, WordPress 6.3.x, WordPress 6.2.x, WordPress 6.1.x, WordPress 6.0.x, WordPress 5.9.x
Tags	ai content, autoblog, automatic, automatically, bot, chatgpt, content, post, product, programmatically, scheduling, scrape, scraper, woocommerce
Created	10 years ago

More Attributes Fewer Attributes

Discussion on WP Content Crawler - Get content from almost any site, automatically!

Discussion on WP Content Crawler - Get content from almost any site, automatically!

turgutsaricam supports this item

2655 comments found.

Hello

turgutsaricam

Tell us what you think!