ruby - getaddrinfo error with Mechanize -

August 15, 2012

I wrote a script that will go through all the databases in our database, verify that their website URL works , And find a Twitter link on your homepage. We have more than 10,000 URLs to verify, after a fraction of the URL verification, we begin to get getaddrinfo errors for each URL.

Here is a copy of the code, which scraps a URL:

  def scrape_url (url) url_found = false twitter_name = zero start agent = Mechanize.new do | A | A.follow_meta_refresh = true end agent.get (normalize_url (url)) what to do Page | Url_found = true twitter_name = find_twitter_name (page) end @ er & lt; & Lt; "[# {@ Current_record}] Success \ n" Rescue exception = & gt; E @ er and lt; & Lt; "[# {@ On_runcord}] error (# {url}):" @ er and lt; & Lt; E.message @err & lt; & Lt; Note: I have also run a version of this code that creates a single mechanize instance which is shared in all calls to Scrap_Arl. It failed in the exact same way.   When I run it on EC2, it goes through about 1,000 URLs, then returns the error for the remaining 9,000 ++:  
  Getaddrinfo : Temporary failure in name resolution    Note, I tried to use both Amazon's DNS server and Google's DNS server, thinking that this is a valid DNS problem May be. I got the same results in both cases.  
 Then, I tried to run it on my local MacBook Pro. This error has been received through 250 only before returning to the rest of the record:  
  getaddrinfo: nodename and pronouns, or not known,    Anyone know how I can get the script through all the records? I got the solution, leaving the MacKaynake connection open and it was up to GC to clean them. After a certain point, there were sufficient open connections which could not establish an additional outbound connection to look for DNS. Here's the code, due to which it works:  
  agent = mechanize.new do | A | A.follow_meta_refresh = true a.keep_alive = false end    By setting keep_alive to false, the connection immediately closes and becomes clear.




















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




Python SQLAlchemy：AttributeError: Neither 'Column' object
nor 'Comparator' object has an attribute 'schema' -






July 15, 2015








  I tried to create a new database in my project, but when I run the script, I get this error, I have another project, this type of definition worked before, but now it gets a single error. I am using Python 2.7.8 and the version of SQLAlchemy module is 0.9.8. By the way, using a project flask- SQLAlchemy, it works well, I'm confused. Traceback information is as follows:    Traceback (most recent call final): File "D: / Projects / IOM / db_create.py", line 4, & lt; Module & gt; From the models the base file "D: \ Projects \ OO-IM \ models.py", in line 15, in the & lt; Module & gt; Column ( 'followed_id', int (), ForeignKey ( 'user.id')) file "C: \ Python27 \ lib \ site-packages \ SQLAlchemy \ SQL \ schema.py", line 369, __new__ schema = metadata. Schema file "C: \ Python27 \ lib \ site-packages \ SQLAlchemy \ SQL \ elements.py", line 662, AttributeError in __getattr__ key): Neither 'column' object nor ...





Read more





java - How not to audit a join table and related entities using
Hibernate Envers? -






May 15, 2014








    I use Eners to be hibernate to audit my institutions.   I have an audit entity,  Foo , which includes a  list & lt; Bars & gt; However, I do not want to audit the  bar  institutions, in the form of properties, I have written that:    @inti @edicated public class Foo {@JoinTable (name = "T_FOO_BAR", joinColumns = @JoinColumn (name = "FOO_ID"), inverseJoinColumns = @JoinColumn (Name = "BAR_ID")) @ManyToMany (Cascade = Persist) @ Indited (targetAuditMode = RelationTargetAuditMode.NOT_AUDITED ) Public listing & lt; Bars & gt; GetBars () {Return bars; }}    Now, I want to retrieve an amendment of  foo :    AuditReader reader = AuditReaderFactory.get (GetEntityManager ()); FU modification = (FU) reader.CreativeTeA () .entensEtrevision (Foo Class, 42) .getSingleResult (); Unfortunately, when I want to retrieve all the data (i.e. when it loads lazy  times ), error me  ORA-00942: table or view is not present , because Tried to query this:    Sel...





Read more





mongodb - CakePHP paginator ignoring order, but only for certain values
-






June 15, 2010








    While pagging my user model in kppp, I can sort with some values, but not others. For example, I can order results from  created  or  email , for example, but  username  or  reputation  seems to come back order by arbitrary command for example with a list of users:    $ this-> paginate = array ('condition' => array ( 'User.is_active' = & gt; true), 'border' => 24, 'order' => array ('User.created' => DESC));    Works as expected, but    $ this - & gt; Paged = array ('conditions' = & gt; array (' user.isIactive '= & gt; true),' border '= & gt; 24,' command '= & gt; array (' user repeats' = ' & Gt; DESC '));    No, no.   I first thought that this could be a database issue, but when I search the database directly, sorted as expected.    Note:  I am using Mangodebi with the Mongo DB plugin of Ichikawa for the KPPHP; Pagnetizing users are used to work properl...





Read more

Search This Blog

PArth Code

ruby - getaddrinfo error with Mechanize -

Comments

Post a Comment

Popular posts from this blog

Python SQLAlchemy：AttributeError: Neither 'Column' object nor 'Comparator' object has an attribute 'schema' -

java - How not to audit a join table and related entities using Hibernate Envers? -

mongodb - CakePHP paginator ignoring order, but only for certain values -