RSS 2.0 Feed

» Welcome Guest Log In :: Register

Pages: (2) < [1] 2 >   
  Topic: The computer thread, Vile silicon machinery< Next Oldest | Next Newest >  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Nov. 28 2008,05:16   

With the recent wget/curl discussion on the UD thread I figured a dedicated thread to computing woes might be in order.

--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
Henry J



Posts: 5786
Joined: Mar. 2005

(Permalink) Posted: Nov. 28 2008,11:03   

So how does one compute a woe, and what's the unit of measurement for those? :p

  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Nov. 28 2008,11:24   

Quote (Henry J @ Nov. 28 2008,17:03)
So how does one compute a woe, and what's the unit of measurement for those? :p

I believe it's measured in Dells.

--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
BopDiddy



Posts: 71
Joined: Nov. 2007

(Permalink) Posted: Nov. 28 2008,15:24   

Nerds, start your engines, this may be what we're looking for:  ChangeDetect.

 
Quote

Know exactly what content has changed on your watched web pages...

ChangeDetect marks web page text for you with color-coded highlights of what has actually changed!

  
Arden Chatfield



Posts: 6657
Joined: Jan. 2006

(Permalink) Posted: Nov. 28 2008,20:18   

Quote (BopDiddy @ Nov. 28 2008,13:24)
Nerds, start your engines, this may be what we're looking for:  ChangeDetect.

   
Quote

Know exactly what content has changed on your watched web pages...

ChangeDetect marks web page text for you with color-coded highlights of what has actually changed!

Actually, I think this is what you guys really need.

--------------
"Rich is just mad because he thought all titties had fur on them until last week when a shorn transvestite ruined his childhood dreams by jumping out of a spider man cake and man boobing him in the face lips." - Erasmus

  
Henry J



Posts: 5786
Joined: Mar. 2005

(Permalink) Posted: Dec. 01 2008,13:58   

Quote (Arden Chatfield @ Nov. 28 2008,19:18)
Actually, I think this is what you guys really need.

Nah, I already have a supply of pocket protectors, from Staples. Don't need any more at the moment. :)

Henry

  
oldmanintheskydidntdoit



Posts: 4999
Joined: July 2006

(Permalink) Posted: Dec. 04 2008,08:15   

A useful firefox plugin is one that "saves a web page as an image". There are several different ones if you search.

Handy for those UD threads where you, for some reason, have several snapshots of it in differing stages of evolution.

Much easier then pasting individual screenshots together.

https://addons.mozilla.org/en-US/firefox/addon/3408
https://addons.mozilla.org/en-US/firefox/addon/1146
https://addons.mozilla.org/en-US....cat=all

--------------
I also mentioned that He'd have to give me a thorough explanation as to *why* I must "eat human babies".
FTK

if there are even critical flaws in Gauger’s work, the evo mat narrative cannot stand
Gordon Mullings

  
JLT



Posts: 740
Joined: Jan. 2008

(Permalink) Posted: Dec. 04 2008,10:10   

Quote (oldmanintheskydidntdoit @ Dec. 04 2008,14:15)
A useful firefox plugin is one that "saves a web page as an image". There are several different ones if you search.

Handy for those UD threads where you, for some reason, have several snapshots of it in differing stages of evolution.

Much easier then pasting individual screenshots together.

https://addons.mozilla.org/en-US/firefox/addon/3408
https://addons.mozilla.org/en-US/firefox/addon/1146
https://addons.mozilla.org/en-US....cat=all

I'm using ScrapBook. It has a lot of options, e.g. you can save parts of a page or save the page including linked graphics/figures (useful for html versions of articles) or ALL linked content (that can take some time to archive), or simply save/archive the page at it is.

A nice feature is that you can edit the page after you archived it e. g. you can remove ads, highlight parts of it, or add notes.

--------------
"Random mutations, if they are truly random, will affect, and potentially damage, any aspect of the organism, [...]
Thus, a realistic [computer] simulation [of evolution] would allow the program, OS, and hardware to be affected in a random fashion." GilDodgen, Frilly shirt owner

  
Reciprocating Bill



Posts: 4265
Joined: Oct. 2006

(Permalink) Posted: Dec. 04 2008,10:23   

I've played around quite a bit with an OS X utility called "FileVacuum," which is essentially wget wrapped in an OS X interface. I've concluded that there are both practical and ethical (but not absolute) obstacles to tracking UD with sufficient coverage to reliably capture obliviated threads.

One reliable way of doing this would be to capture successive snapshots of UD with sufficient frequency to catch most changes. Previous copies need be renamed/grandfathered and retained. When obliviation is detected we either refer to our most recent copy, or, if that copy reflects the obliviation, go back a few hours and consult an earlier snapshot.

The obvious problem with this is that capturing the entire site, which is over 300 MB and thousands of files, takes hours even on a relatively fast cable connection and probably imposes a burden on their bandwidth that I find ethically unacceptable.

Alternatively, after having captured a snapshot of UD in its entirety once, I envision using wget to detect and download only new UD files (passing over files for which copies exist locally) relative to copies of the original archive. That newly updated copy is stored and never updated further. That is, the process would be repeated, say, 2x daily, but again against the same stored, unchanged copy of an original archive, relative to which new threads (relative to the original archive) that have been updated since the original archive was grabbed would again be "new," and hence copied locally. Manually grabbing new copies of the indexes for the current month may also be desirable (since such indexes would be a "new file" just once, even as references to new threads are added, so repeated updates help). This process would catch early copies of each thread, and repeating the process against a never modified stored archive of the entire site would result in updates that include comments added to threads since the local archive was first created. These successive, updated copies of UD are retained an consulted when needed.  

However, I found that even the process of using wget to compare local and server copies of their site and downloading only new files, while using much less bandwidth, is very time consuming - again taking several hours. That repeated interrogation of UD - which requires that every file on their server be checked against the local copy (with no transfer when a local copy is found) - still must be very demanding upon their bandwidth and disk activity, and I wonder if that doesn't go over the line ethically.

However, I don't know enough about the resources that process demands to make that judgement. Others may also have ethical lines drawn at different locations, so I'd be interested in your thoughts on either topic (or other approaches that did not occur to me.)

Really, the whole idea seems like more trouble than it is worth.

--------------
Myth: Something that never was true, and always will be.

"The truth will set you free. But not until it is finished with you."
- David Foster Wallace

"Here’s a clue. Snarky banalities are not a substitute for saying something intelligent. Write that down."
- Barry Arrington

  
Erasmus, FCD



Posts: 6349
Joined: June 2007

(Permalink) Posted: Dec. 04 2008,13:15   

sounds like design detection to me!

--------------
You're obviously illiterate as hell. Peach, bro.-FtK

Finding something hard to believe based on the evidence, is science.-JoeG

the odds of getting some loathsome taint are low-- Gordon E Mullings Manjack Heights Montserrat

I work on molecular systems with pathway charts and such.-Giggles

  
Nerull



Posts: 317
Joined: June 2007

(Permalink) Posted: Dec. 04 2008,14:02   

What I would do is:

1. Download only the html - images and such aren't required.

2. Track only the 'n' most recent threads, to save bandwidth and time.

3. Feed the html for each post and comment thread into a subversion repository, which will track changes. Possibly after being fed through a script to strip out everything except the post and comments (Ads, sidebar stuff, etc.), and detect 404s.

What you could also do is store posts and comments in a database by comment id, so that when a comment goes missing it can be pulled from the database with a simple query. This wouldn't track changes without some more work, but it would save deleted comments and posts.

--------------
To rebut creationism you pretty much have to be a biologist, chemist, geologist, philosopher, lawyer and historian all rolled into one. While to advocate creationism, you just have to be an idiot. -- tommorris

   
Reciprocating Bill



Posts: 4265
Joined: Oct. 2006

(Permalink) Posted: Dec. 04 2008,14:27   

Quote (Nerull @ Dec. 04 2008,15:02)
What I would do is:

1. Download only the html - images and such aren't required.

2. Track only the 'n' most recent threads, to save bandwidth and time.

3. Feed the html for each post and comment thread into a subversion repository, which will track changes. Possibly after being fed through a script to strip out everything except the post and comments (Ads, sidebar stuff, etc.), and detect 404s.

What you could also do is store posts and comments in a database by comment id, so that when a comment goes missing it can be pulled from the database with a simple query. This wouldn't track changes without some more work, but it would save deleted comments and posts.

The organization of the UD directory structure makes a lot of this difficult, or at least difficult to script. Each thread gets it's own directory containing an index.html file that contains both the original post and all the comments. But those are scattered throughout dozens of directories named by topic ("Intelligent Design," "Politics" etc.) There are hundreds of threads and dozens of topics. A real mess.

--------------
Myth: Something that never was true, and always will be.

"The truth will set you free. But not until it is finished with you."
- David Foster Wallace

"Here’s a clue. Snarky banalities are not a substitute for saying something intelligent. Write that down."
- Barry Arrington

  
Wesley R. Elsberry



Posts: 4991
Joined: May 2002

(Permalink) Posted: Dec. 04 2008,14:36   

The back-up system that the UD folks got so upset about simply archived the RSS feed.

--------------
"You can't teach an old dogma new tricks." - Dorothy Parker

    
Reciprocating Bill



Posts: 4265
Joined: Oct. 2006

(Permalink) Posted: Dec. 04 2008,15:06   

Quote (Wesley R. Elsberry @ Dec. 04 2008,15:36)
The back-up system that the UD folks got so upset about simply archived the RSS feed.

Were they upset b/c they were being archived, or b/c the archive was publicly available on the intertubes (assuming it was)?

--------------
Myth: Something that never was true, and always will be.

"The truth will set you free. But not until it is finished with you."
- David Foster Wallace

"Here’s a clue. Snarky banalities are not a substitute for saying something intelligent. Write that down."
- Barry Arrington

  
Nerull



Posts: 317
Joined: June 2007

(Permalink) Posted: Dec. 04 2008,15:35   

Well, they aren't actually in a directory, that's just mod_rewrite at work.

The posts are actually stored in a database by Wordpress, normally accessable with the format www.whatever.com/?p=<postid>

mod_rewrite rules can change things to make it more human friendly, which is what UD does. But the old style still works.

For example, http://www.uncommondescent.com/?p=3000 is DaveScot asking how the bigfoot shaped rock on mars is explainable by evolution.

--------------
To rebut creationism you pretty much have to be a biologist, chemist, geologist, philosopher, lawyer and historian all rolled into one. While to advocate creationism, you just have to be an idiot. -- tommorris

   
Reciprocating Bill



Posts: 4265
Joined: Oct. 2006

(Permalink) Posted: Dec. 04 2008,15:41   

Quote (Nerull @ Dec. 04 2008,16:35)
Well, they aren't actually in a directory, that's just mod_rewrite at work.

The posts are actually stored in a database by Wordpress, normally accessable with the format www.whatever.com/?p=<postid>

mod_rewrite rules can change things to make it more human friendly, which is what UD does. But the old style still works.

For example, http://www.uncommondescent.com/?p=3000 is DaveScot asking how the bigfoot shaped rock on mars is explainable by evolution.

Mmmm.

Increasingly, a good nap is more tempting then extensive squints at WP's data structures for the sole purpose of tweaking UD's collective nose.

--------------
Myth: Something that never was true, and always will be.

"The truth will set you free. But not until it is finished with you."
- David Foster Wallace

"Here’s a clue. Snarky banalities are not a substitute for saying something intelligent. Write that down."
- Barry Arrington

  
Nerull



Posts: 317
Joined: June 2007

(Permalink) Posted: Dec. 04 2008,15:43   

One problem I've found simply going by the numbers is that every post seems to have 50 different revisions, which of which has a different number. Unfortunately they are blank, so you can't use that to see how things have been edited.

--------------
To rebut creationism you pretty much have to be a biologist, chemist, geologist, philosopher, lawyer and historian all rolled into one. While to advocate creationism, you just have to be an idiot. -- tommorris

   
Nerull



Posts: 317
Joined: June 2007

(Permalink) Posted: Dec. 04 2008,15:49   

I'd tend to agree that its a lot of work for not much purpose, which is why I haven't tried it.

--------------
To rebut creationism you pretty much have to be a biologist, chemist, geologist, philosopher, lawyer and historian all rolled into one. While to advocate creationism, you just have to be an idiot. -- tommorris

   
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Dec. 05 2008,12:23   

One way of doing it would be to subscribe to the comment feed that exists for each post.

Just add /feed to the url of the post.

--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Jan. 09 2009,19:26   

Here's a yahoo pipes thing you can use to continually get all comments from at least the most current UD threads in a single rss.

http://pipes.yahoo.com/pipes....der=rss

Simple to do and I figured it might be useful to keep track of memory holed comments.

--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
stevestory



Posts: 13407
Joined: Oct. 2005

(Permalink) Posted: Jan. 24 2009,21:33   

Quote (dnmlthr @ Jan. 09 2009,20:26)
Here's a yahoo pipes thing you can use to continually get all comments from at least the most current UD threads in a single rss.

http://pipes.yahoo.com/pipes....der=rss

Simple to do and I figured it might be useful to keep track of memory holed comments.

Oooohhhh....that's going in the bookmarks. Until UD figures out how to shut it down.

   
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Mar. 06 2009,18:16   

By slapping together a simple markov chain text generator and web scraper I've built an UD of my own.

Right now the corpus just consists of the hundred or so latest comments, but I'm working on a way to further automate that process.

Would you call Poe on these comments?

Quote
All of this nature are quite useless in the universe, even if they take a strong stance against social and moral relativism.


Quote
So, processor behaviour as that of a processor and its accompanying flavor of skepticism (not forgetting that is evolutionarily-related to plastids, see if it has a long way to go and as mutations and variation made the patch more concave, the perceieved direction of insisting that evolution started somewhere in the debate of the supernatural.


Quote
Anyone who does not have been widely understood for Hegel to make a point of contention on both sides; negated often by the pseudoscience of creation science since they tend to agree with Pournelle on one point: the biggest danger today is the way they would actually learn what is metaphysically possible.


--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
oldmanintheskydidntdoit



Posts: 4999
Joined: July 2006

(Permalink) Posted: Mar. 06 2009,18:31   

Quote (dnmlthr @ Mar. 06 2009,18:16)
Would you call Poe on these comments?

Reminds me of Kairosfocus.

--------------
I also mentioned that He'd have to give me a thorough explanation as to *why* I must "eat human babies".
FTK

if there are even critical flaws in Gauger’s work, the evo mat narrative cannot stand
Gordon Mullings

  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Mar. 06 2009,18:32   

This is way too much fun.

Quote
(At least, I'm assuming it was intelligently designed until we learned EXACTLY how it was he who stands firm to the inference that most people continue to distruss such assertions and fairy tales for 150 years of the universe.


Quote
In this case the feeling was not unlike the experience of pleasure), but not necessarily vice-versa.


And my personal favourite:
Quote
It's like if we are using terms in quite common and standard ways.


--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Mar. 07 2009,16:23   

This is the last round of markov-generated UD-isms, promise.

Quote
It's an interview, Denyse, not an intelligent agent.

Quote
They go so far as actual program code goes, I am convinced of its anti-intellectual orientation.

Quote
Does heat have some way of examples and Webster definitions.

Quote
For example, Dawkins is fond of talking about fairies and goblins - (clearly mythical beings that can output binary strings.

Quote
But that is shiny and hairless thus somehow matching someones definition of pseudo-science or anti-science.


--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
Erasmus, FCD



Posts: 6349
Joined: June 2007

(Permalink) Posted: Mar. 07 2009,16:43   

that stuff is great.  could you find a place to put them somewhere on the internetz where they might be useful to certain woolen friends of ours?  i can see lots of unmarried moms basement dwellers nodding and harrumping in agreement to these comments.

--------------
You're obviously illiterate as hell. Peach, bro.-FtK

Finding something hard to believe based on the evidence, is science.-JoeG

the odds of getting some loathsome taint are low-- Gordon E Mullings Manjack Heights Montserrat

I work on molecular systems with pathway charts and such.-Giggles

  
dnmlthr



Posts: 565
Joined: Mar. 2008

(Permalink) Posted: Mar. 07 2009,16:58   

Erasmus: It's interesting to see how fast the output improves with a larger corpus to draw from.

The programs are really rough around the edges right now, but cleaning them up should be pretty easy to do once I'm through with the midterms.

--------------
Guess what? I don't give a flying f*ck how "science works" - Ftk

  
midwifetoad



Posts: 4003
Joined: Mar. 2008

(Permalink) Posted: Mar. 07 2009,17:37   

One of my internet buds once engaged a creationist with unedited responses from Eliza. It went on for an hour or so of back and forth.

--------------
Any version of ID consistent with all the evidence is indistinguishable from evolution.

  
AmandaHuginKiss



Posts: 150
Joined: Dec. 2008

(Permalink) Posted: Mar. 07 2009,19:58   

We used to call that text generator Shakespeare as that what was used in the SciAm article we pinched it from (Showing my age again).
We once wrote a summary and a conclusion to a report and then fed this into shakespeare to produce the body of the report. Nobody noticed.

You are probably more up on this than I am but if you feed enough raw material into the program and expand the search string long enough, you would create a great sockpuppet (You could have two stores, one with the entire opus and another with the current thread)

  
Lou FCD



Posts: 5455
Joined: Jan. 2006

(Permalink) Posted: Mar. 08 2009,10:04   

I spent several hours yesterday working on my Pop's machine instead of studying for Tuesday's Bio 112 exam or writing my paper on Frost.

It's an 800mhz hand-me-down with 64 Megs of SD ram and Windows ME (no service packs) on it.

I got it upgraded (finally) to XP home and put Firefox on it. He liked that much better than IE5, but prefers AOL 7.0.

Just kill me, please, before he calls again.

--------------
“Why do creationists have such a hard time with commas?

Linky“. ~ Steve Story, Legend

   
  45 replies since Nov. 28 2008,05:16 < Next Oldest | Next Newest >  

Pages: (2) < [1] 2 >   


Track this topic Email this topic Print this topic

[ Read the Board Rules ] | [Useful Links] | [Evolving Designs]