SEARCH KEYWORD -- Free book



  Cleansing data with Pig and storing JSON format to HBase with Pig UDF

Introduction This post will explain you the way to clean data and store JSON format to HBase. Hadoop architect experts also explain Apache Pig and its advantages in Hadoop in this post. Read more and find out how they do it. This post contains steps to do some basic clean the duplication data and convert the data to JSON format to store to HBase. Actually, we have some built-in lib to parse JSON in Pig but it is important to manipulate the JSON data in Java code before store to HBase. Apache Pig...

   JSON,HADOOP ARCHITECT,APACHE HBASE,PIG UDF     2016-06-10 01:13:41

  Google has done more for the world with ngrams

Data is valuable asset for a company in the Internet world. With data of users, a company can gain lots of benefits. They can push specified ads to users by analyzing user behaviors, they can even sell the data to third parties. Data is very important for a company's success, so some companies will keep their data secret in order to gain advantages over competitors. However, Google seems do it in another way. Google shared their ngrams text corpus publicly, which basically contains valuable info...

   Ngram,NLP,Data     2013-12-12 07:56:02

  Announcing New Feed Service

We’re thrilled to announce that Feed is now live! Feed brings you the hottest tech stories daily, covering trending topics like AI, LLMs, Rust, Python, and more. What is Feed? Feed is your personalized tech news curator. It uses AI to gather and filter the most relevant stories from around the world, ensuring you stay informed without being overwhelmed. Whether it’s a breakthrough in AI or a new feature in Rust, Feed delivers just the right amount of information. How Does It Work? F...

   AI,CHINESE,TECH FEED     2024-12-12 20:04:59

  How to be jQuery-free?

jQuery is now the most famous JavaScript library. There are around 57.3% websites in the world using jQuery, i.e, 6 out of 10 websites are using jQuery. If we only consider those websites which use libraries, then the percentage is even higher which is 91.7%. Although jQuery is very popular, its size is still a headache to many websites maintainers. The uncompressed jQuery 2.0 has a size of 235KB, the size is 81KB after optimization.The jQuery 1.8.3 which supports IE 6/7/8 has a uncompressed si...

   jQuery,JavaScript,ECMAScript,CSS3     2013-05-13 11:53:20

  Everything you need to know about what happened at Apple’s Special Event 2019

Apple’s special event that was announced in mid-November is finally over. The company held a presentation event in New York City where it highlighted the best apps and games of 2019 with no new software or hardware announcements at all.   On the event, five different nominations were announced: Apple’s chosen favourite apps, top free apps, top paid apps, top free games, and top paid games for 2019. Usual keynotes from Apple The tech industry is pretty accustomed to Apple’s...

   APPLE,EVENTS     2019-12-04 05:50:15

  Google+ is sick

Google says that Google+ now has over 170million registered users, but according a new research report of Fast Company about Google+,  Google+ is just a virtual ghost town.This research selected a sample of 40,000 random Google+ users. The result is :The average post on Google+ has less than one +1, less than one reply, and less than one re-shareRoughly 30% of users who make a public post never make a second oneEven after making five public posts, there is a 15% chance that a user will not ...

   Google+,Sick,Research     2012-05-16 05:41:35

  China has blocked Wikipedia again

Recently lots of Chinese found they experienced problems while trying to access Wikipedia, the world's largest and most well-known free encyclopedia. It turns out that the notorious Chinese GFW(Great Firewall) is playing the critical role here. It blocks the Chinese version of Wikipedia. The reason why the Chinese version of Wikipedia is blocked is said to be that Wikipedia has changed its default protocol from HTTP to HTTPS. This change made it difficult for the GFW to filter some sensiti...

   CHINA,WIKIPEDIA,GFW,BLOCKED,NEWS     2015-12-05 19:04:12

  HeartBleed: OpenBSD now starts to clean up OpenSSL

Since the disclosure of HeartBleed bug in OpenSSL, some arguments emerge around the safety of OpenSSL, the largest open source SSL/TLS library used by large number of servers, applications. Some people are even starting to create their own version of SSL library. This includes OpenBSD, a famous Unix like open source operating system. Just a few days after the HeartBleed bug, OpenBSD forked a new branch of OpenSSL and started to clean up the forked branch and plans to merge it to its own cod...

   HeartBleed,OpenBSD     2014-04-15 04:21:26

  Google Drive will support offline editing soon

Normal 0 7.8 pt 0 2 false false false EN-US ZH-CN X-NONE    Google Drive,Offline editing,Cloud computing     2012-06-04 06:42:47

  Use Java ThreadLocal with caution

According to Oracle documentation, ThreadLocal is a class provides thread-local variables. These variables differ from their normal counterparts in that each thread that accesses one (via its get or set method) has its own, independently initialized copy of the variable. ThreadLocal instances are typically private static fields in classes that wish to associate state with a thread. In short, ThreadLocal variables are variables belong to a thread, not a class or an instance of a class. One common...

   JAVA,MEMORY LEAK, THREADLOCAL     2015-11-03 07:31:57