java - Accessing files from other filesystems along with hdfs files in a hadoop mapreduce application -

August 15, 2014

I know that we can reduce the map with a normal Java application. Now in my case there are also HDFs and files on other filesystems to deal with files to reduce jobs. Is it possible that we can use files from other file systems, while simultaneously using files on HDFS. Is this possible?

So basically my intention is that I have a large file that I want to put in HDFS for parallel computing and then compare the blocks in this file with some other files (which I Do not want to put in HDFS code, they need to be used once as a full-length file.

You can use the files in your In order to distribute them, they can open and read the file in the Configure () method (do not read them in map () because it will be multiple times .

Edit

Reduce the job to access the file from the local file system in your map, you distribute those files You can add cache to when you set up your job configuration.

  JobConnect Job = New JobConf (); Distributed Cache.AdcacheFile (New URI ("/ myapp / lookup.dat # lookup.dat"), Job);    MapReduce Framework will ensure that those Files will be accessible by your Mappers.  
  Configure a public VoID ID (JobConf job) {// Cache archives / files path [local files = distributed cache.GetLocalKeach files (jobs); // open, read and store for use in the map phase}    and delete files when you're done.




















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




Python SQLAlchemy：AttributeError: Neither 'Column' object
nor 'Comparator' object has an attribute 'schema' -






July 15, 2015








  I tried to create a new database in my project, but when I run the script, I get this error, I have another project, this type of definition worked before, but now it gets a single error. I am using Python 2.7.8 and the version of SQLAlchemy module is 0.9.8. By the way, using a project flask- SQLAlchemy, it works well, I'm confused. Traceback information is as follows:    Traceback (most recent call final): File "D: / Projects / IOM / db_create.py", line 4, & lt; Module & gt; From the models the base file "D: \ Projects \ OO-IM \ models.py", in line 15, in the & lt; Module & gt; Column ( 'followed_id', int (), ForeignKey ( 'user.id')) file "C: \ Python27 \ lib \ site-packages \ SQLAlchemy \ SQL \ schema.py", line 369, __new__ schema = metadata. Schema file "C: \ Python27 \ lib \ site-packages \ SQLAlchemy \ SQL \ elements.py", line 662, AttributeError in __getattr__ key): Neither 'column' object nor ...





Read more





java - How not to audit a join table and related entities using
Hibernate Envers? -






May 15, 2014








    I use Eners to be hibernate to audit my institutions.   I have an audit entity,  Foo , which includes a  list & lt; Bars & gt; However, I do not want to audit the  bar  institutions, in the form of properties, I have written that:    @inti @edicated public class Foo {@JoinTable (name = "T_FOO_BAR", joinColumns = @JoinColumn (name = "FOO_ID"), inverseJoinColumns = @JoinColumn (Name = "BAR_ID")) @ManyToMany (Cascade = Persist) @ Indited (targetAuditMode = RelationTargetAuditMode.NOT_AUDITED ) Public listing & lt; Bars & gt; GetBars () {Return bars; }}    Now, I want to retrieve an amendment of  foo :    AuditReader reader = AuditReaderFactory.get (GetEntityManager ()); FU modification = (FU) reader.CreativeTeA () .entensEtrevision (Foo Class, 42) .getSingleResult (); Unfortunately, when I want to retrieve all the data (i.e. when it loads lazy  times ), error me  ORA-00942: table or view is not present , because Tried to query this:    Sel...





Read more





mongodb - CakePHP paginator ignoring order, but only for certain values
-






June 15, 2010








    While pagging my user model in kppp, I can sort with some values, but not others. For example, I can order results from  created  or  email , for example, but  username  or  reputation  seems to come back order by arbitrary command for example with a list of users:    $ this-> paginate = array ('condition' => array ( 'User.is_active' = & gt; true), 'border' => 24, 'order' => array ('User.created' => DESC));    Works as expected, but    $ this - & gt; Paged = array ('conditions' = & gt; array (' user.isIactive '= & gt; true),' border '= & gt; 24,' command '= & gt; array (' user repeats' = ' & Gt; DESC '));    No, no.   I first thought that this could be a database issue, but when I search the database directly, sorted as expected.    Note:  I am using Mangodebi with the Mongo DB plugin of Ichikawa for the KPPHP; Pagnetizing users are used to work properl...





Read more

Search This Blog

PArth Code

java - Accessing files from other filesystems along with hdfs files in a hadoop mapreduce application -

Comments

Post a Comment

Popular posts from this blog

Python SQLAlchemy：AttributeError: Neither 'Column' object nor 'Comparator' object has an attribute 'schema' -

java - How not to audit a join table and related entities using Hibernate Envers? -

mongodb - CakePHP paginator ignoring order, but only for certain values -