I am wondering after the microsoft update bug then it should open many peoples eyes on the fragility of the internet. I think the sysadmins should look into moaking the site offline because things will get worse. I think the sysadmin should talk to kiwix about making a offline version of the site. I am sure they would assist and host it for you.
I am a little bit busy right now working on the debian dvd. I am porting all the apps from web to binaries an makeing new apps for the king james bible. Its alot of work. I will not get a chance to work on a web scraper for your site right now. Also I need to get some additional storage since I do not know the size of your site. Don't have the cash at the moment for that. I guess by thge time I get the external storage the sys admins might figure out a solution to the is problem. I will say this in final. This site should be preserved on the wayback machine because of its uniqueness and important role to the open source community. It is a valuable treasure.
I might not do the webscraper. The options with using a backup the site offline and removing the user accounts or disabling them could work. Its is up to you.
Take a look at survivorlibrary. I have all the books offline in pdf organized by category. I have the ability to download the html contents of each category page and get the pdf links and download them into the revelant folder. The html pages are small footprint. I do this evert 2 weeks to see if he puts up anything. If your site was organized like this then I could build succh a tool for your site.
I still think having an offline version of your site would be wonderful because you could upload it to archive.org and people can go there to get the downloads. That way your main site is not affected. It is up to you on what you decide. I will not scrape your website because it is not setup like the survivorlibrary.
Hi. I think I might have found a solution for you. You can try this. If it works then you can have a exact copy of your site without the logins. That mean you can zip it up and upload it to a site like archive.org. I host my custom linux iso's there. That could ease the burden on your server. I would setup a linux vm setup to run drupal and make a backup copy of the site and migrate it into the virtual machine and there you can do all your experimenting. Just liek the guy in the article without messing with your main site. This might be the solution and it will retain your website branding so everyone will know it came form you.
I will try to make a bash script to see if I can do it in a minimal manner. The tools I make are for linux only. I do not use windows anymore because of security issues.
Yes. That is what I said in one of my earlier comments. I need the license file (txt) with the content (mp3, mid, zip, etc). Maybe you could make the html files by the user account that put them up. That wayyou could potentially preserve everything in place.
Hi. Let me explain who I am. I consider myself a person who deblieves in preserving data for the bad times to come. I know the internet is goign to be taken down. Its only a matter of time with the evil leaders we have in this world. Preserving information so others can access it is very important. This is the reason I am making a tool (using lazarus ide) to download pdf files from survivors library showing respect to the owner of the site. Your site is complicated because it a cms. I am assuming you have the data files (mp3, zip etc) on the physical disk. If you are using linux then you could make a html files with the links to the files on the file system that I could download that file. This way I could check the html files and see if I already have the files. Only download the files I do not have. html files should not be a hassle on the servers. It should be simple.
Hi. I know about httrack. I do not want to disrespect the website owner by hitting his servers with so many requests. This is the reason I ask if he has an offline option that i can download. I do not need the site contents. I just want the files with the necessary copyright (cc0, cc1 etc) license file. That way I could always search through the file system to see what I want. I am building a webscraper for survivorlibrary because that site is straight forward to understand and I use a method that checks all the files against what I have on disk. This tool will be on the next release of the dvd. This way I only download what is new which tends to be very small. The problem with opengameart site is it has a cms and that crates a problem. I have to wait and see what happens. At least I want to respect the owner.
hey
I am wondering after the microsoft update bug then it should open many peoples eyes on the fragility of the internet. I think the sysadmins should look into moaking the site offline because things will get worse. I think the sysadmin should talk to kiwix about making a offline version of the site. I am sure they would assist and host it for you.
https://kiwix.org/en/
Look at what they have available. OpenGameArt.org could be on the list.
https://library.kiwix.org/#lang=eng
I am a little bit busy right now working on the debian dvd. I am porting all the apps from web to binaries an makeing new apps for the king james bible. Its alot of work. I will not get a chance to work on a web scraper for your site right now. Also I need to get some additional storage since I do not know the size of your site. Don't have the cash at the moment for that. I guess by thge time I get the external storage the sys admins might figure out a solution to the is problem. I will say this in final. This site should be preserved on the wayback machine because of its uniqueness and important role to the open source community. It is a valuable treasure.
I might not do the webscraper. The options with using a backup the site offline and removing the user accounts or disabling them could work. Its is up to you.
Take a look at survivorlibrary. I have all the books offline in pdf organized by category. I have the ability to download the html contents of each category page and get the pdf links and download them into the revelant folder. The html pages are small footprint. I do this evert 2 weeks to see if he puts up anything. If your site was organized like this then I could build succh a tool for your site.
I still think having an offline version of your site would be wonderful because you could upload it to archive.org and people can go there to get the downloads. That way your main site is not affected. It is up to you on what you decide. I will not scrape your website because it is not setup like the survivorlibrary.
https://www.survivorlibrary.com/index.php/library-download
sounds like a plan. have to see if the site owner is ok with that. Your user name is of a major old testament prophet. Cool name.
I also feel that your site should be preserved on the wayback machine because it is considered an important resource.
Hi. I think I might have found a solution for you. You can try this. If it works then you can have a exact copy of your site without the logins. That mean you can zip it up and upload it to a site like archive.org. I host my custom linux iso's there. That could ease the burden on your server. I would setup a linux vm setup to run drupal and make a backup copy of the site and migrate it into the virtual machine and there you can do all your experimenting. Just liek the guy in the article without messing with your main site. This might be the solution and it will retain your website branding so everyone will know it came form you.
https://drupal.stackexchange.com/questions/109156/duplicating-drupal-sit...
I will try to make a bash script to see if I can do it in a minimal manner. The tools I make are for linux only. I do not use windows anymore because of security issues.
Yes. That is what I said in one of my earlier comments. I need the license file (txt) with the content (mp3, mid, zip, etc). Maybe you could make the html files by the user account that put them up. That wayyou could potentially preserve everything in place.
Hi. Let me explain who I am. I consider myself a person who deblieves in preserving data for the bad times to come. I know the internet is goign to be taken down. Its only a matter of time with the evil leaders we have in this world. Preserving information so others can access it is very important. This is the reason I am making a tool (using lazarus ide) to download pdf files from survivors library showing respect to the owner of the site. Your site is complicated because it a cms. I am assuming you have the data files (mp3, zip etc) on the physical disk. If you are using linux then you could make a html files with the links to the files on the file system that I could download that file. This way I could check the html files and see if I already have the files. Only download the files I do not have. html files should not be a hassle on the servers. It should be simple.
Hi. I know about httrack. I do not want to disrespect the website owner by hitting his servers with so many requests. This is the reason I ask if he has an offline option that i can download. I do not need the site contents. I just want the files with the necessary copyright (cc0, cc1 etc) license file. That way I could always search through the file system to see what I want. I am building a webscraper for survivorlibrary because that site is straight forward to understand and I use a method that checks all the files against what I have on disk. This tool will be on the next release of the dvd. This way I only download what is new which tends to be very small. The problem with opengameart site is it has a cms and that crates a problem. I have to wait and see what happens. At least I want to respect the owner.
Pages