Michael Fire

Ben Gurion University, Software & Information Systems Engineering, Faculty Member

University of Washington, Computer Science & Engineering, Post-Doc

Followers

1,823

Following

Co-authors

Public Views

My passion is data. Specifically, I want to use massive datasets in innovative and effective ways to make the world a better place. I am a Senior Lecturer (Assistant Professor) at the Software and Information Systems Engineering Department at Ben-Gurion University of the Negev (BGU). In addition, I founded the Data Science for Social Good Lab. Data science was a central part of my PhD studies, for which I was awarded the Kreitman Prize. Later I was the recipient of the Moore/Sloan Data Science Fellowship and the WRF Innovation Postdoctoral Fellow in Data Science at the University of Washington. My main research interests lie in big data, machine learning, social network analysis, and security and privacy. Also, I have gained extensive hands-on experience as a data scientist working for several companies and organizations.

less

Alexander Kremiansky

Flavia Țăran

Babes-Bolyai University

University of Cambridge

Mark Artyukh

University of Warwick

Richard Mills

Valencia College

Alessandro Di Stefano

Teesside University

InterestsView All (16)

Uploads

Journal Articles by Michael Fire

Using data science to understand the film industry's gender gap

Nature Humanities and Social Sciences Communications, 2020

Data science can offer answers to a wide range of social science questions. Here we turn attentio... more Data science can offer answers to a wide range of social science questions. Here we turn attention to the portrayal of women in movies, an industry that has a significant influence on society, impacting such aspects of life as self-esteem and career choice. To this end, we fused data from the online movie database IMDb with a dataset of movie dialogue subtitles to create the largest available corpus of movie social networks (15,540 networks). Analyzing this data, we investigated gender bias in on-screen female characters over the past century. We find a trend of improvement in all aspects of women‘s roles in movies, including a constant rise in the centrality of female characters. There has also been an increase in the number of movies that pass the well-known Bechdel test, a popular—albeit flawed—measure of women in fiction. Here we propose a new and better alternative to this test for evaluating female roles in movies. Our study introduces fresh data, an open-code framework, and novel techniques that present new opportunities in the research and analysis of movies

Scientometric trends for coronaviruses and other emerging viral infections

GigaScience, 2020

Background COVID-19 is the most rapidly expanding coronavirus outbreak in the past 2 decades. To ... more Background
COVID-19 is the most rapidly expanding coronavirus outbreak in the past 2 decades. To provide a swift response to a novel outbreak, prior knowledge from similar outbreaks is essential.
Results
Here, we study the volume of research conducted on previous coronavirus outbreaks, specifically SARS and MERS, relative to other infectious diseases by analyzing >35 million articles from the past 20 years. Our results demonstrate that previous coronavirus outbreaks have been understudied compared with other viruses. We also show that the research volume of emerging infectious diseases is very high after an outbreak and decreases drastically upon the containment of the disease. This can yield inadequate research and limited investment in gaining a full understanding of novel coronavirus management and prevention.
Conclusions
Independent of the outcome of the current COVID-19 outbreak, we believe that measures should be taken to encourage sustained research in the field.

Over-Optimization of Academic Publishing Metrics: Observing Goodhart's Law in Action

GigaScience, 2019

Abstract Background The academic publishing world is changing significantly, with ever-growing nu... more Abstract
Background
The academic publishing world is changing significantly, with ever-growing numbers of publications each year and shifting publishing patterns. However, the metrics used to measure academic success, such as the number of publications, citation number, and impact factor, have not changed for decades. Moreover, recent studies indicate that these metrics have become targets and follow Goodhart’s Law, according to which, “when a measure becomes a target, it ceases to be a good measure.”
Results
In this study, we analyzed >120 million papers to examine how the academic publishing world has evolved over the last century, with a deeper look into the specific field of biology. Our study shows that the validity of citation-based measures is being compromised and their usefulness is lessening. In particular, the number of publications has ceased to be a good metric as a result of longer author lists, shorter papers, and surging publication numbers. Citation-based metrics, such citation number and h-index, are likewise affected by the flood of papers, self-citations, and lengthy reference lists. Measures such as a journal’s impact factor have also ceased to be good metrics due to the soaring numbers of papers that are published in top journals, particularly from the same pool of authors. Moreover, by analyzing properties of >2,600 research fields, we observed that citation-based metrics are not beneficial for comparing researchers in different fields, or even in the same department.
Conclusions
Academic publishing has changed considerably; now we need to reconsider how we measure success.

The Rise and Fall of Network Stars Draft Article The Rise and Fall of Network Stars

Elsevier Information Processing & Management, 2020

Trends change rapidly in today's world and are readily observed in lists of most important people... more Trends change rapidly in today's world and are readily observed in lists of most important people, rankings of global companies, infectious disease patterns, political opinions, and popularities of online social networks. A key question arises: What is the mechanism behind the emergence of new trends? To answer this question, we can model real-world dynamic systems as networks, where a network is represented by a set of vertices and their corresponding links. The features and topology of these networks can then be analyzed, including how they evolve over a long period of time. However, the actual mechanisms behind these dynamic systems remain difficult to understand. Here we show the construction of the largest publicly available network evolution dataset to date, which we utilized to reveal how key entities in a network gain power. We employed state-of-the art data science tools and extensive cloud computing resources to create this massive corpora that contains 38,000 real-world networks and 2.5 million graphs. Then, we performed the first precise wide-scale analysis of the evolution of networks with various scales. Three primary observations emerged: first, links are most prevalent among vertices that join a network at a similar time; second, the rate that new vertices join a network is a central factor in molding a network's topology; and third, the emergence of network stars (high-degree vertices) is correlated with fast-growing networks. We applied our learnings to develop a simple network-generation model-a flexible model based on large-scale, real-world data. Our results are applicable to dynamic systems in nature and society, and deliver a better understanding of how stars within these networks rise and fall.

Michael Fire

Uploads

Journal Articles by Michael Fire

Journal Articles under Review by Michael Fire

Refereed Conference Proceedings by Michael Fire

Log In