Truth Discovery in Big Data Social Media Sensing Applications
Omar Ahmed1, Sangeeta Gupta2, Mohammed Hasibuddin3

1Omar Ahmed*, Computer Science Engineering Student, Chaitanya Bharathi Institute of Technology, Hyderabad, India.
2Dr Sangeeta Gupta, Associate Professor in Computer Science Engineering Department, Chaitanya Bharathi Institute of Technology, Hyderabad, India.
3Mohammed Hasibuddin, Computer Science Engineering Student, Chaitanya Bharathi Institute of Technology, Hyderabad, India.
Manuscript received on May 16, 2020. | Revised Manuscript received on June 01, 2020. | Manuscript published on June 10, 2020. | PP: 999-1004 | Volume-9 Issue-8, June 2020. | Retrieval Number: H6311069820/2020©BEIESP | DOI: 10.35940/ijitee.H6311.069820
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (

Abstract: The detection of truthful information amid data provided by online social media platforms (e.g., Twitter, Facebook, Instagram) is a critical task in the trend of big data. Truth Discovery is nothing but the extraction of true information or facts from unwanted and raw data, which has become a difficult task nowadays in today’s day and age due to the rampant spread of rumors and false information. Before posting anything on the social media platform, people do not consider fact-checking and the source authenticity and frantically spread them by re-posting them which has made the detection of truthful claims more difficult than ever. So, this problem needs to be addressed soon since the impact of false information and misunderstanding can be very powerful and misleading. This mission, truth discovery, is targeted at establishing the authenticity of the sources and therefore the truthfulness of the statements that they create without knowing whether it is true or not. We propose a Big Data Truth Discovery Scheme (BDTD) to overcome the major problems. We have three major problems, the main one being “False information spread” where a large number of sources lead to false or fake statements, making it difficult to distinguish true statements, now this problem is solved by our scheme by studying the various behaviors of sources. On Twitter for example rumormongering is common. The second problem is “lack of claims” where most outlets contribute only a tiny small number of claims, giving very few pieces of evidence and making it not sufficient to analyze the trustworthiness of such sources, this problem is addressed by our scheme where it uses an algorithm that evaluates the claim’s truthfulness and historic contributions of the source regarding the claim. Thirdly the scalability challenge, due to the clustered design of their existing truth discovery algorithms, many existing approaches don’t apply to Big-scale social media sensing cases so this challenge is managed by our scheme by making use of frameworks HT Condor and Work Queue. This scheme computes both the reliability of the sources and, ultimately, the legitimacy of statements using a novel approach. A distributed structure is also developed for the implementation of the proposed scheme by making use of the Work Queue (platform) in the HT Condor method (maybe distributed). Findings of the test on a real-world dataset indicate that the BDTD system greatly outperforms the existing methods of Discovery of Truth both in terms of performance and efficiency. 
Keywords: Big Data, Rumors, Scalable, Social Media, Truth Discovery, Twitter.
Scope of the Article: Big Data Application Quality Services