Add more sites to the scrapy start_urls #6

groovecoder · 2016-03-13T18:28:04Z

With just nextcity.org and govtech.com, I was able to crawl and index 18.8k articles/pages. @chimchim237 knows the bigger list of real sites to crawl.

chimchim237 · 2016-03-23T21:22:36Z

@groovecoder i made a REALLY good list. ...and I thought I emailed it to you? but now i can't find it. :(

groovecoder · 2016-03-24T18:16:42Z

Yeah, I thought I saw that somewhere too ... but I can't find it in email. 😢 Re-creating on GH issue will keep it public and permanent at least.

chimchim237 · 2016-05-05T18:16:04Z

nextcity.org
strongtowns.org
citylab.com
iqc.ou.edu
urbanland.uli.org
planetizen.com
streetsblog.net
governing.com

cnt.org/blog
thehappycity.com/blog/
smartgrowthamerica.org/blog
smartgrowthtulsa.com/blog
100resilientcities.org/blog
sunlightfoundation.com/blog
brookings.edu/about/programs/metro/research

chimchim237 · 2016-05-05T18:17:29Z

I found the file to edit, and can easily do that if you want me to, this weekend. Lemme know.

chimchim237 · 2017-03-24T15:04:23Z

found another great one.
http://na.smartcitiescouncil.com/
http://cc.smartcitiescouncil.com/

chimchim237 · 2017-06-22T23:30:17Z

Another one:
http://datasmart.ash.harvard.edu/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more sites to the scrapy start_urls #6

Add more sites to the scrapy start_urls #6

groovecoder commented Mar 13, 2016

chimchim237 commented Mar 23, 2016

groovecoder commented Mar 24, 2016

chimchim237 commented May 5, 2016 •

edited

Loading

chimchim237 commented May 5, 2016

chimchim237 commented Mar 24, 2017 •

edited

Loading

chimchim237 commented Jun 22, 2017

Add more sites to the scrapy start_urls #6

Add more sites to the scrapy start_urls #6

Comments

groovecoder commented Mar 13, 2016

chimchim237 commented Mar 23, 2016

groovecoder commented Mar 24, 2016

chimchim237 commented May 5, 2016 • edited Loading

chimchim237 commented May 5, 2016

chimchim237 commented Mar 24, 2017 • edited Loading

chimchim237 commented Jun 22, 2017

chimchim237 commented May 5, 2016 •

edited

Loading

chimchim237 commented Mar 24, 2017 •

edited

Loading