behaviours and purchasing preferences – that clearly distinguish them from customers in other segments.
Sources: https://hbr.org/2014/07/what-you-need-to-know-about-segmentation
http://ijecm.co.uk/ ISSN 2348 0386
https://www.iiste.org/Journals/index.php/EJBM/article/viewFile/647/540
While some broad elements of segmentation, such as age and geography, have been applied in the development sector, the power of behavioural and psychographic segmentation has been largely overlooked. According to Samuel, “psychographics, which measure customers’ attitudes and interests rather than ‘objective’ demographic criteria, can provide deep insight that complements what we learn from demographics” (see Harvard Business Review article on psychographics ). This type of segmentation provides a deeper understanding of the desires, needs, and decision-making considerations of a potential user of a product or service. Applied correctly, it could enhance efficacy of public health initiatives, ensure new products reach the people most likely to need and use them, and increase the likelihood of lasting behavioural change. Using the example of voluntary medical male circumcision (VMMC), this paper shows how segmentation has been applied in development and discusses the challenges in both measuring and adopting segmentation as part of program design.
In an era of constrained funding for HIV primary prevention basics like demand creation for male and female condoms, there is a need to be more targeted with resources in order to reach the right people with the right intervention – a higher priority than reaching all people. Groups sharing common attributes are more likely to respond similarly to a given demand creation strategy, but addressing all men aged between 15–19 as if they are identical is unlikely to result in cost-efficient, relevant, or relatable communication, whether it takes place at an interpersonal (e.g., peer educator) or mass level.
To better understand what the literature reveals about the barriers to and benefits of segmentation, both for VMMC in particular and for demand creation for HIV preventive interventions more broadly, we queried scholarly databases, health and innovation journals, and other publications for studies, analyses, and peer-reviewed articles on segmentation published between January 2015 and September 2018. The types of literature considered included case studies, systematic reviews, meta-analysis, and journal articles. We performed a detailed search of the existing peer-reviewed literature using PubMed , ProQuest , ScienceDirect , Google Scholar , and the abstract directories of the International AIDS Society (IAS). We searched these sources for keywords including HIV prevention, segmentation, demand creation, demand generation, innovation, behaviours, human-centred design, segmentation evaluation, campaign qualitative evaluation, campaign success, campaign failure, target audience, social marketing, campaign lessons learned, and VMMC. Our primary application of interest was VMMC, with a secondary emphasis on HIV prevention programs related to adolescent girls and young women (AGYW). Sub-Saharan Africa was the priority region of interest, with examples elsewhere in low and middle-income countries a subordinate focus. The hundreds of results generated by these criteria were then individually examined and collated by subject-matter experts for relevance to different main aspects of this particular piece of work; full citations were developed for 61 select articles, along with complete summaries of their significance. A second round of keyword searching was conducted, this time expanding sources queried from the listed databases to include premier marketing journals and publications in order to provide more complete information regarding existing applications of segmentation (as an approach predominantly employed and evaluated predominantly within the context of private sector work). The resulting, updated bibliography included 40 sources and was reviewed a second time to produce the comprehensive list of sources (44) which informed initial drafting of the literature review text. After subsequent rounds of writing and editing, including extracting information deemed unnecessary and as a result eliminating some citations, this literature review’s current bibliography has been updated to represent its use of 37 carefully curated sources. It should be noted that this literature review did not include a need to address consent issues.
Our findings indicate that market segmentation has, to date, most often been applied to global health fields that function for profit and, as such, tend to view their clients as customers, with preferences to discover and cater to, rather than as patients with needs that are often assumed to be homogenous within a broadly defined population, such as by diagnosis or perceived need. The latter view is associated with interventions designed from the top down. However, areas of global health with a development focus or non-profit structure and culture have been slower to adopt or apply market segmentation. Moreover, existing published materials on its applications tend not to measure the impact of segmentation itself, but the impact of the intervention to which segmentation was applied, generally through uptake measures.
On top of the traditional difficulties of measuring impact in the global health and development space (e.g., long observation times, difficult data gathering, complex influences on decision-making), segmentation requires additional thought to isolate the impact of the strategies themselves. While we are able to observe that using segmentation strategies offers a framework to design more nuanced and resonant interventions, it is difficult to isolate the impact of segmentation from the many other steps, methodologies, and strategies that are part of a robust human-centred design process. As multi-disciplinary consortiums take on multi-year projects, it becomes increasingly difficult to isolate specific contributions when evaluating overall efficacy.
It is also difficult to measure impact because a segmentation strategy is not a product or a message or an experience on its own; it is a vehicle for developing them. In this way, the segmentation step is not tested directly, but indirectly through the things it produces – making it challenging to draw an objective cause and effect relationship.
Because the value and impact are difficult to isolate, segmentation is often overlooked or attempted unsystematically. Sgaier et al. determined that approaches to demand generation were inconsistent, not evidence-based, and poorly coordinated. This work goes on to say that “political and social factors, including ignorance of the need for strategic demand generation, may contribute to inadequate funding and focus.” The authors found that there was scant evidence on approaches to demand generation for VMMC, both in terms of understanding drivers of demand and in terms of evaluating existing interventions 4 .
In a later article for the Stanford Social Innovation Review, Sgaier et al . expanded on these issues, citing particular cases. For example, in Niger, a limited understanding of the value of segmentation at the highest levels caused discussions with governments and partners to drag on for more than three years before segmentation could begin to be implemented. Even once segmentation is underway, Sgaier et al . note myriad issues, including restricted ability to design the research, limited number of people with experience in the segmentation process, and difficulty transferring the findings into large-scale programs 2 . In the case of HIV prevention, the psychographic measures of risk perception and belief about the efficacy of a particular intervention proved to be a more effective approach to the segmentation of men into target audiences than basic demographic distinctions. Designing a program of social promotion that is tailored for groups based on these values, attitudes, and decision-making requirements creates the greatest likelihood for change in HIV-related behaviours among each segment 5 .
Meanwhile, Terris-Prestholt and Windmeijer looked at interventions that promote behaviour change. They determined that the impact of interventions aimed at the ongoing behaviours that are relevant to prevention are slow to take hold and should therefore be evaluated over a longer timeframe 6 . However, current funding mechanisms generally do not allow sufficient time for such change to take place and be observed, making interventions based on segmentation hard to evaluate.
Though there are difficulties in exact measurement, the most compelling evidence for the efficacy of segmentation is to look at when and how it is used. Specifically, it is often brought in when nothing else is working or when there is a drop-off in the efficacy of an intervention. This typically occurs when generalized, homogenous efforts have reached the most easily persuaded among the target audience and nuanced interventions are needed for more resistant members of the target audience. This was the case with VMMC, where early adopters of the procedure had been reached and demand was plateauing ( Figure 1 ).
Source: https://www.avac.org/sites/default/files/resource-files/AVACreport2018.pdf . This graphic has been reproduced with permission from AIDS Vaccine Advocacy Coalition (AVAC).
Segmentation can also be used to investigate why efficacy has been uneven. The knowledge gained from segmentation can be used to design specific interventions (e.g., messaging, experiences, campaigns) in situations where demand is lagging, or to prioritize outreach to specific groups if resources are constrained. This is especially true when used in conjunction with human-centred design, another methodology increasingly applied in global health programs when traditional strategies start to produce declining returns on investment.
Despite challenges, segmentation also offers a large range of opportunities for the sector. The Reproductive Health Supplies Coalition is one of a growing number of organisations in development that acknowledge that segmentation provides empirical evidence which can help guide the most efficient and effective use of resources. The Coalition states that “market segmentation data can shape family planning…(and) it can increase market efficiency for the government stewards of public resources” 7 . It goes on to say that policymakers can use market segmentation research to “draft evidence-based policy initiatives, giving government officials a more useful context to decide which policies are worth enacting.”
The identification of segments can also guide decisions about how to meet performance and delivery objectives within a health system. For example, segmentation studies can identify whether one group of end users will likely access a prevention service in a private clinic rather than a public one. Most obviously, segmentation can be used to guide communication at all consumer touchpoints from the community to the clinic, and on a mass scale.
The example of VMMC lends itself well to a segmentation approach. The audience of those who may undergo the procedure already encompasses distinct subdivisions by age (e.g., early infant male circumcision (EIMC), early adolescent male circumcision (EAMC), “catch-up” population segments comprised of older men). The literature is replete with examples of such demographic segmentation and its applicability to developing more effective promotions, including via social marketing 8 – 13 . Notable examples include the Kingdom of Eswatini’s successful VMMC program, which prioritised EIMC as the “sustainment” component of a comprehensive set of VMMC interventions for multiple age brackets 14 ; Lane et al. ’s supplement of nine studies across South Africa, Zimbabwe, and Tanzania, which targeted 10–14-year old adolescent males through messaging around key incentives or barriers to VMMC uptake (motivation, counselling, wound healing, parental involvement, female peer support, quality of in-service communication, and providers' perceptions) 15 ; and an observational prospective intervention study in the Orange Farm township of South Africa, which successfully obtained male circumcision prevalence of 80% among adult men within just three months 16 . Modelling investigations likewise find age to be a beneficial, and in some cases particularly cost-effective 17 , 18 , basis for market segmentation 19 , 20 . Geographic segmentation is similarly common 13 , 17 – 19 , 21 , and through modern mapping technology can offer novel applications 21 , 22 .
Importantly, however, recent work also emphasizes the need for segmentation that goes beyond age to distinguish among individuals’ perceived motivations or disincentives for VMMC, as well as beyond the candidate population for VMMC to highlight the role of decision-making “influencers” (i.e., female partners, family members like parents, grandparents, and parents-in-law, trusted community leaders like sports team coaches), who may be effectively targeted through social marketing promotions of VMMC to encourage its uptake among the men in their lives 23 . Figure 2 depicts a representation of segments applied to VMMC in Zambia.
Source: https://healthcommcapacity.org/wp-content/uploads/2017/06/Albert-Machinda-Society-for-Family-Health.pdf This graphic has been reproduced with permission from The Bill and Melinda Gates Foundation.
The work to segment Zambian and Zimbabwean men along behavioural and psychographic lines provides the most straightforward example of a non-age-based population dissection with findings readily applicable to social marketing interventions. In this case, men were segmented in alignment with factors that motivated and/or supported them on a personal level to undergo VMMC. Men were also segmented according to influences at the community or structural level that discouraged or encouraged uptake 24 . Another important finding from the “influencer” cluster of studies relates to how campaigns are conducted after segmentation. VMMC candidates, perhaps due to the intimate nature of the decision and procedure, exhibit a strong preference for individual over mass communication on this issue 9 , 25 . This preference is borne out by both the insignificant results yielded in a study which leveraged the mass communication platform of SMS to deploy VMMC-related information and counselling 26 , as well as by two unsuccessful VMMC promotion case studies which cited a lack of consideration for sociocultural context (such as including the perspectives and gaining the support of “traditional leaders, healers and circumcisers”) as a reason for failure 27 . In demand creation campaigns, personal counselling or one-to-one approaches favourably impacted willingness to undergo VMMC or to consider it for one’s dependents. Quality market segmentation conducted before beginning such an individualised intervention has the potential to make this otherwise costly and labour-intensive – yet highly effective – approach more feasible to implement 9 , 16 , 28 .
Market segmentation can also identify whom not to target. A 2015 analysis “explored correlates of male circumcision status among men and their social, economic, health and sexual behaviour factors.” This analysis provided characteristics for better targeting and intervention design. In this case, limited resources for uptake campaigns could be directed toward populations of greatest need and minimize the use of resources directed at segments that were unlikely to choose VMMC under any circumstance 29 .
There is also some evidence for market segmentation’s value in creating demand for HIV preventive services outside of VMMC. Cremin et al . use a mathematical model to isolate certain subsets of Nairobi’s population in which HIV incidence is on the rise, in contrast to its trend of decline at the city level, and to suggest optimal interventions for reducing HIV infection among these high-risk groups 30 . Reed et al . extrapolate lessons learned from VMMC scale-up in the region to support oral pre-exposure prophylaxis (PrEP) expansion among another targeted market segment 31 . Also addressing the AGYW audience, Celum et al . explore how social marketing and innovative market segmentation can increase demand for and optimise uptake and effective use of PrEP 32 ; Eakle et al . point to this potent combination as a particularly useful means of enhancing PrEP demand generation among more sceptical communities 33 ; and Luecke et al . examine the demographic and behavioural correlates of preferred PrEP formulations, arguing that a deeper understanding of women’s product preferences can guide not only product development but also drive demand creation through social marketing 34 . Sgaier, too, opines from family planning work in India that psychographic-behavioural segmentation can better forecast demand and, “as in the private sector, a staged market launch that actively stimulates uptake can be used to match appropriate products to suitable customers” (see devex article on contraception in Uttar Pradesh women ). Ayikwa and Jager advocate for social marketing as the “ultimate weapon” in combating HIV/AIDS transmission and overcoming related stigmas 35 .
All these papers corroborate Rao and McCoy, who state that “behaviour change isn’t just about crafting the perfect message; creating better programs requires really listening to and understanding the patient experience” [ https://ssir.org/articles/entry/fostering_behavior_change_for_better_health#bio-footer ]. They stress that “borrowing tools from the private sector…to understand, track, and influence customers can greatly enhance global health programs that require changes in attitudes or behaviour.”
The literature collectively drive toward the idea that what the public health sphere needs is a new mindset. Instead of viewing its target audience as patients with diagnoses, the audience should be seen as multidimensional consumers with preferences and needs. Current public health segmentation has been almost solely demographic; though valuable at a basic level, this is rudimentary 36 . Research marketing pioneer Daniel Yankelovich states in the Harvard Business Review that demographic segmentation “implies that differences in reasons for buying, in brand choice influences, in frequency of use, or in susceptibility will be reflected in differences in age, sex, income, and geographical location. But this is usually not true” (see Harvard Business Review article on market segmentation ). Basically, age segmenting can only be useful as a very general indication of patterns of behaviour, as not everyone in the same age band will behave the same way or respond the same way to experiences. Even “geodemographic classifications such as ACORN (a classification of regional neighborhoods), while useful for indicating likely very general patterns of spending power, do not reveal the absurd assumption that everyone…drives the same car, reads the same newspapers, eats the same food and so on” (see The Marketing Journal article on market segmentation ). Demographic and age segmentation are some of the easiest to develop, but they provide limited guidance. In order to successfully effect change, implementers need to connect deeply with their audience by looking past its superficial characteristics to the attitudinal, behavioural, and contextual factors that guide its members’ decision-making. This does not mean that public health solutions should be fragmented, but rather that precise messaging – still framed in the broader context of health goals – can be developed for and directed to audiences with whom it is most likely to resonate strongly 4 .
At this stage, it is not yet possible to definitively conclude that market segmentation leads to measurably better HIV prevention results, but it can be asserted that market segmentation leads to interventions with measurably better HIV prevention results. The literature present ample evidence for the value of market segmentation as a component of demand creation for HIV prevention interventions, including VMMC and, more recently, oral PrEP. While traditional applications of market segmentation in healthcare, such as age and geography, remain useful as components of more nuanced population stratifications, behavioural-psychographic segmentation presents the greatest potential for efficacy in uptake of HIV prevention measures, both broadly and in the case of VMMC specifically. In a later article, also for the Harvard Business Review, Yankelovich and Meer write that “non-demographic segmentation began more than 40 years ago as a way to focus on the differences amongst customers that matter most strategically” 37 . Ultimately, what the public health sphere needs is a shift: from considering possible users of its products as a single group with the same desires and behaviours based on age or location to seeing them as multidimensional consumers with individual preferences and needs.
Acknowledgements.
Thanks are extended to Jeanne Baron, Mitchell Warren, and Emily Bass from AVAC for their assistance with manuscript preparation and helpful comments throughout the process.
[version 1; peer review: 2 approved, 1 approved with reservations]
This work is supported by the Bill and Melinda Gates Foundation [OPP1161329]. The AIDS Vaccine Advocacy Coalition’s (AVAC) product introduction and access efforts are supported through grants from the Bill and Melinda Gates Foundation, via the HIV Prevention Market Manager grant agreement [OPP1135316], and by the generous support of the American people through the U.S. President’s Emergency Plan for AIDS Relief (PEPFAR) and the U.S. Agency for International Development (USAID), via the OPTIONS Consortium Cooperative Agreement No. AID-OAA-A-15-00035). The content of this article is solely the responsibility of the authors and does not necessarily represent the views of the Gates Foundation, PEPFAR, USAID, or the United States Government. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Jason b. reed.
1 Jhpiego, Baltimore, MD, USA
Sema sgaier.
2 Surgo Foundation, Washington, D.C., USA
3 Harvard T.H. Chan School of Public Health, Boston, MA, USA
4 University of Washington, Seattle, WA, USA
The referee team finds the article to be an important contribution to the field because it reinforces the potential value of an underappreciated approach to global development/non-profit public health programming, specifically related to program uptake. A case is made that the private sector is better at market segmentation. Development/public health donors often prescribe demographic and behavioral sub-populations—sex, age, geographic area, wealth quintile—for program services based upon epidemiology and return on investment expectations. Implementers may simply resort to the same or similar sub-populations for structuring demand creation campaigns. Going beyond these conventional public health categories to include psychographic and behavioral segments may represent an important opportunity for greater cost-effectiveness in program execution, by more efficiently influencing uptake. However, the authors stop short of describing how the private sector (more) successfully determines psycho-behavioral market segments and acts upon them in selection of messages and channels. This may be because little is published on methodologies from the private sector, which has little incentive to share proprietary approaches. This may well be beyond the scope of this paper, but some discussion of bases of segmentation could be helpful to the reader to understand the concept. As it is, the reader is left with the vague feeling that “everyone is different” and therefore all messaging must be micro-tailored, which would be even less efficient/cost-effective than existing approaches.
The article could be improved by defining terminology upfront, e.g., demographic, geographic, behavioral vs. psychographic market segmentation. A strong definition of segmentation, including the four qualities of viable segments, is provided here for further consideration:
“Heterogeneous markets are divided into homogenous markets by identifying what are known as the appropriate bases of segmentation. An appropriate segmentation base might be age in a
market where young people prefer a different product or service than old people. Sex, socioeconomic status, and residence are common segmentation bases due to assumptions about differing preferences between males and females, the rich and the poor, and the urban and rural. Many other segmentation bases exist, including psycho-graphics and lifestyle. In marketing, the appropriateness of one or more segmentation bases is evaluated in terms of whether each basis is identifiable (able to be measured), actionable (theoretically able to be changed through an intervention), accessible (defines populations able to be reached), responsive (empirically are changed by an intervention), substantial (large in size), and stable (do not change for a period long enough to design, monitor and evaluate an intervention)” Frank et al. (1972 1 ).
The details of parameters for the literature review are clear. The number of professional disciplines of subject matter experts reviewing and summarizing the findings is less clear, though presumably comprised of some or all of the authorship team. Given the disconnect between private and public sector approaches to segmentation, it would be valuable to know whether interpretations of relevance and importance (to impact and as specifically relates to voluntary medical male circumcision) varied by background of the experts.
The main conclusions appear to be framed in relation to public health intervention uptake (or adherence). However, the other a priori interests stemming from market segmentation that may be of equal (or arguably even greater) importance may take precedence, e.g., using segmentation to identify services/products more aligned with potential users preferences, demand forecasting, and preference of where to access products and services (public vs. private sector, formal vs. informal outlets). If market segmentation were more thoughtfully used earlier in processes, segmentation might not be so frequently the approach of last resort. In turn, judging segmentation impact might be broader than solely focused on improvements in uptake.
We have read this submission. We believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
1 DesireLine LLC, Istanbul, Turkey
- Clarify the forms of market segmentation throughout the paper to be specific about which forms of segmentation are being discussed in each point made on its use
- Clarify segmentation as a means to dividing the market and identifying the differences to then target differential interventions/services/products vs. as a method itself to understand and intervene.
I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
1 CollaborateUp, Washington, DC, USA
The title includes both segmentation and social marketing but social marketing is not defined in the article and is not discussed at any length. I would remove social marketing from the title, or have it play a bigger role in the article (and define it upfront or in a table as has been done with segmentation).
While the authors state that segmentation was not measured extensively, there are too few examples for any reader to try to replicate any work done with segmentation. The work of VMMC, which is used as the main case study for the piece, includes insufficient detail for replication and does not include specific data for Zimbabwe or Zambia. How did behavioral segmentation impact uptake of VMMC in these two countries? What were the challenges of executing against the segmentation results. What was the impact of activities across each of the 7 segments in Zambia? Can additional data on impact and/or details on execution be included for these two countries? Even the VMMC studies mentioned don't seem to point specifically to the examples in Zambia and Zimbabwe.
The definitions of quality market segmentation in this paper are too vague, making the prospect of applying it to a program very difficult. Table 1 provides characteristics of useful market segments but additional detail or examples are needed. Table 1 states that segments should be substantial enough to be profitable or to meet the financial needs of a company. While this works for the commercial sector, it is unhelpful for development work which is typically not about profitability or even cost effectiveness. Can authors include examples of a 'substantial' size that would be applicable and relevant to HIV prevention? In addition, "sustainable" is also used to describe a useful market segment, stating that the segment should be stable enough for a long-enough period of time to be marketed strategically and warns off using something like "lifestyle' which too easily fluctuates. But what is a long-enough period of time to be marketed strategically? Do attitudes and behaviors shift less frequently than 'lifestyle? If not using age or demographics, what type of attributes should be used that will not fluctuate easily and how long do those attributes need to be stable? Better defining or providing concrete examples from public health segmentation (or the commercial sector) will strengthen the piece and make it more relevant to implementers.
The article seems at times to conflate demand generation and demand creation with segmentation. In paragraph 5 of the Results section concludes that segmentation is often overlooked or unsystematic, using the Sgaier article's criticism of poor demand creation approaches as evidence, but these seem two different things. Demand creation does not necessarily include behavioral segmentation. While it may include demographic segmentation, that is not the focus of the piece. TheTerris-Prestholt and Windmeijer article on behavior change interventions is also cited as a reason why segmentation would be hard to evaluate and measure, but the conclusion seems forced unless segmentation is part of the reason behavior change interventions take a longer time to take hold.
I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Scientific Data volume 11 , Article number: 882 ( 2024 ) Cite this article
Metrics details
Recent advances in computer vision and deep learning techniques have facilitated significant progress in video scene understanding, thus helping film and television practitioners achieve accurate video editing. However, so far, publicly available semantic segmentation datasets are mostly limited to indoor scenes, city streets, and natural images, often ignoring example objects in action movies, which is a research gap that needs to be urgently filled. In this paper, we introduce a large-scale, high-precision semantic segmentation dataset of props in Chinese martial arts movie clips, named ChineseMPD. Specifically, this dataset first establishes segmentation rules and general review criteria for audiovisual data, and then provides semantic segmentation annotations for six weapon props (Gun, Sword, Stick, Knife, Hook, and Arrow) with a summary of 32,992 objects.To the best of our knowledge, this dataset is the largest semantic segmentation dataset for movie props to date. ChineseMPD dataset not only significantly expands the application of traditional tasks of computer vision such as object detection and scene understanding, but also opens up new avenues for interdisciplinary research.
Background & summary.
Semantic segmentation and scene understanding and have become important research directions in the field of computer vision 1 , 2 , 3 , 4 , 5 , 6 , with applications covering a wide range of areas including autonomous driving 6 , surface crack detection 1 , and so on. Among these task, semantic segmentation is crucial, as recognizing and classifying different instance objects at the pixel level is one of the most important methods for scene understanding. Chinese martial arts movies provide a unique and fascinating research area for semantic segmentation due to their rich visual and cultural heritage. These films are characterized by complex fight scenes and iconic props, elements that play a crucial role in narrative and aesthetics. However, although several seminal datasets such as Cityscapes 7 , PASCAL VOC 2012 8 and COCO Stuff 9 datasets are available, they often focus on semantic segmentation of urban scenes or natural scenes, neglecting prop segmentation in movie scenes. Currently, semantic segmentation of props in Chinese martial arts movies is still challenging due to the lack of benchmark datasets.
Existing movie datasets can be divided into two categories. One category is movie description datasets. Heilbron et al . 10 introduced ActivityNet, a movie video dataset for understanding human activities. The dataset contains 203 videos of different categories that can be used for human activity understanding, e.g., video categorization or activity detection. Tapaswi et al . 11 proposed the MovieQA dataset for purposes aimed at evaluate the ability of algorithms to automatically understand video and text stories. It contains 14,944 multiple-choice questions and the corresponding 5 multiple-choice answers. Huang et al . 12 used MovieNet, a multimodal dataset for movie comprehension, which contains annotations corresponding to different aspects of descriptive text, location and action labels, and detection frames. Although these video scenes are rich, they lack corresponding segmentation labels and are often used for classification tasks such as video classification, scene recognition and sentiment analysis.
There are very few available datasets for segmentation of video objects, except for movie descriptions. Pont-Tuset et al . 13 proposed DAVIS, a public dataset and benchmark designed specifically for the task of video object segmentation.DAVIS contains dense semantic annotations for different objects in different life scenarios. Wei et al . 14 proposed an actor-centered dataset for video object segmentation YouMVOS. This data is labeled only for the segmentation of multiple shots of the actors themselves. Similarly, Ding et al . 15 proposed MOSE, a semantic segmentation dataset containing 5,200 video objects in 36 categories. This dataset aims to explore the ability of artificial intelligence (AI) algorithms for video object segmentation of common objects in complex scenarios. In summary, there is no publicly available dataset for martial arts props, and the only existing publicly available dataset for semantic segmentation of video objects not only differs from martial arts props in terms of object shapes and sizes, but also is not applicable to semantic segmentation of martial arts props.
To address these gaps, this paper introduces ChineseMPD, a semantic segmentation dataset of props from classic Chinese martial arts movies. ChineseMPD provides pixel-level annotations for six categories, including Gun, Sword, Stick, Knife, Hook, and Arrow. Fine annotations are provided for the props in the images through fine-grained annotation and strict review process to ensure the high quality and authenticity of the dataset. The pipeline of our proposed Chinese martial arts film props dataset can be seen from the Fig. 1 . Based on the video data of 8 action movies segments, the dataset provides a summary of 32,992 objects with fine annotations showing different scenes (e.g., fight scenes, training scenes, ritual scenes, rest scenes, and market scenes). The selected clips from Chinese martial arts films feature unique action sequences. Through continuous narrative blocks and a series of individual shots, these clips ultimately present the “chivalrous” plot, which combines aesthetics and storytelling. However, particularly with continuous frame images that contain narrative elements, it is challenging to annotate film clips using automated models. Firstly, due to the limitations of composition, shooting angles, and lighting, the model cannot distinguish between deliberate blurriness and occlusions. Secondly, the differences between continuous frame labels are significant and rely heavily on contextual semantic understanding, making it difficult to correlate the rich semantic information in films. Thirdly, the dynamic object labels within the film clips are often neglected. Our dataset employs fine-grained semantic segmentation techniques to deeply annotate props in Chinese martial arts films. In addition, we have established relevant rules and comments for the extraction and annotation of film clips. In order to more clearly visualize the content of our marks, Fig. 2 provides the specific distribution of various props in A-H movie clips.
Pipeline of Chinese martial arts film props dataset. ( A ) Data selection. Select the movie slice with plot and props to determine the category of semantic segmentation; ( B ) Rule establishment. Different colors are used to distinguish the semantic segmentation categories, and the rules for labeling props are established. ( C ) Data labeling. Label the props in film clips. ( D ) Data review. Experienced three more rigorous reviews.
Number of props annotations per clip. A–H represent 8 different Chinese martial arts classic movie clips.
Our dataset provides a new perspective to explore and analyze the complex interactions and dynamic changes in videos by complementing the existing semantic segmentation of movie objects. In addition, the establishment of the dataset promotes research in the field of computer vision on cutting-edge technologies such as motion recognition, scene reconstruction, and virtual reality, which offers the possibility of realizing a more intelligent and automated film and television post-production process. Meanwhile, it also provides rich materials for interdisciplinary research, promotes the integration of AI with cultural analysis, historical research and other fields, and opens up new ways for the digital protection and innovative inheritance of traditional cultural heritage.
This section reviews the process of our collection and implementation of the dataset of Chinese martial arts film props. The labeling of props was performed manually with AI assistance, which will be elaborated on in this section. At the same time, we have also established a special annotation and review method for the film props dataset.
A total of 21 people participated in data labeling and review. Among them, the data labeling personnel is composed of 11 undergraduates, and the reviewing personnel is composed of 3 junior auditors, 2 senior auditors and 5 acceptance personnel. The data labeling personnel received 7 days of theoretical training and labeled 2000 images. The data reviewing personnel conducted research and discussion on international standards and segmentation requirements in the early stage, and gained a comprehensive understanding of video labeling. Moreover, the participants jointly formulated the specification and criteria for the annotation and review of the dataset, such as the review rules for object contours and fuzzy images in the annotation of props.
We selected film clips from the China Film Archive ( https://www.cfa.org.cn ) and Zhejiang Communication Television Art Archive ( http://ysys.cuz.edu.cn ) for labeling of film props. The resolution of selected films is 2560 × 1440 and 1920 × 1080. The selection process for film clips starts in October 2021 and ends in March 2022, and the annotation process starts in October 2021 and ends in August 2022. These clips were carefully selected following copyright reviews and are used solely for academic research purposes, adhering to academic standards. Specifically, according to Article 22 of the Copyright Law of the People’s Republic of China, the limited use of published works for teaching and research purposes is permitted under specific conditions. There are no copyright issues involved. Finally, we selected eligible clips from more than 700 movie clips to build our dataset 16 , each clip is about 2 minutes long. It is important to note that the total number of labeled props varies depending on the specific requirements of each film clip and the relevant scene. Swords and knives have more counts than others among the various props.
The plotshot division of the film clip was completed by a graduate student with rich editing experience. The film clip was roughly and finely cut with Adobe Premiere, the film image and audio were aligned, the subtitles of which blocked the images were removed. Finally, the movie clips are extracted at a rate of 4 frames per second. Each movie clip is saved as a JPG image with a resolution of 1920 × 1080 pixels.
To ensure the quality and time consuming are within our affordable range, we used interactive annotation tool EISeg 17 to label frame images with high accuracy, which is able to annotate images in an interactive way. In addition, EISeg embedded segmentation algorithms for both the coarse and fine granularity levels, which facilitate the annotation procedure. As shown in Fig. 3 , this method can generate an annotation mask, it is convenient to adjust the polygon vertices of the mask to further improve the accuracy.
Illustration of annotated images. The annotated items are distinguished by different colors, and the edge annotation points are connected into a semantic segmentation outline.
As shown in Fig. 4 , to make our marked props more clear and more distinct, we have made corresponding annotation examples for the content interacting with the props. The dataset we provide also has relevant labels for obvious characters and scenes, which can provide research references. Specifically, through the previously selected label, the props are manually marked and the edges are corrected. Each segmented shot is annotated with contours referencing the format used in the COCO dataset. The annotations and corresponding labels are subsequently stored in JSON format once the contours are finalized. The default path for saving is the new label folder in the dataset folder, where JSON files of all marked points are stored at the same time. For the reason that it is not necessary to perform semantic segmentation by edge tracing points, but by clicking any part of the props, the method propsed by Benenson et al . 18 was used to generate the mask. In order to ensure the quality of annotation, we have also established a set of annotation checking specifications for the segmentation of film and television elements and props, making our dataset 16 more reliable.
Semantic segmentation dataset annotation of props in film clips. The props are Sword, Stick, Hook, Arrow, Knife and Gun. From left to right, they are Original, Colour mask and Foreground. The colour mask unifies the contents of the removed labels into a blue background; Foreground realizes the visualization of annotation types.
The semantic segmentation types of martial arts film props include Knife, Sword, Gun, Stick, Hook, and Arrow, as shown in Table 1 . In the actual annotation process, we have established a set of standards for props semantic segmentation dataset annotation checking, as shown in Fig. 5 . The composition of the audit team is as follows: primary audit (3 persons), senior audit (3 persons), and senior management (2 persons) to audit and correct the annotation effectively. Advanced audit returns unqualified datasets and corrects them according to the established rules, which can be stated as:
Pipeline of props annotation and corresponding checking specifications. ( a ) Procedure of props annotation. ( b ) Procedure of annotation inspection.
For the area of the image to be annotated, we define an indicator called Pixel Boundary Error (PBE), which can be formulated as: \(PBE=\frac{a\cap b}{a\cup b};\) where a ∩ b is the area of overlap between the actual props to be labeled and the presumed props; a ∪ b is the sum of the area for current labeled props and the actual props. Specifically, we require a PBE of not more than 0.75 for stationary objects and less than 0.85 for moving objects. The examples of unqualified annotations can be seen in Table 2 .
This section summarizes the entire processing flow of our dataset. The dataset 16 is open to the public and provide the necessary ways to use and organize data. Researchers can register with ScienceDB to access the FTP download link to our dataset. For ScienceDB account authentication and registration procedures, see https://www.scidb.cn/en . The link to access our dataset is https://www.scidb.cn/en/anonymous/SlpaelFy .
The film selection is related to each subsequent step, we chose Chinese martial arts film clips to build the dataset. There are a lot of fighting scenes in Chinese martial arts film clips, and the props interact with people to a large extent. The choice of martial arts props can also cover China’s classic weapons. The martial arts films we first selected are shots lasts about 2 minutes, no shading and blurring are shown in these shots, which ensures the authenticity of the screen. Based on selected shots, the movie clip images are then extracted under the a rate of 4 frames per second for one shot.
Film props account for a small proportion of the film screen. Although the semantic segmentation model has achieved high accuracy in today’s increasingly developed in-depth learning, to ensure that the quality of the dataset is under control, we invited experts with specialized technical backgrounds to participate in the project for early assessment.
The organized dataset is composed of several folders. Each folder contains a specific sequence of data. For each of image, three labeled images will be generated, namely the pseudo color image, the grayscale image, and the cutout image. The purpose of generating three labeled images is to increase the intuitive understanding of the segmented image. The annotation points of the image are presented in a JSON file. As shown in Fig. 6 , we name the similar movie clips as the same ID, such as “m00x”. “m00x_fen” is a frame image folder formed after four frames per second segmentation at the shot level. The same level also contains descriptions of segmentation specifications; It is described by “m00x_label.txt” (label semantics and numerical matching relationship information) and “m00x_details.xlsx” (details of labels and related descriptions). The next level is the dataset of x shots, named “m00x_fen00x_dataSet”, which contains m00x_ fen00038_00000001.jpg (the original frame used for annotation) and a label folder, which is used to store the annotated data. Each original frame image contains three labeled images, as well as a JSON file named “annotations.json” with the information about annotation points.
Folder level. The entire dataset is divided into four levels. The upper part is the description of the dataset, and the lower part is the actual name. The sections marked black are folders, and the others are in corresponding formats.
As for the technical verification of dataset, a technical team composed of 5 experts conducted manual checking and sampled the labeled props at the interval of three images. Each expert conducted an independent visual inspection of all labels to ensure the accuracy. The props in this study were manually annotated, so a set of strict technical methods is also established during the inspection. This method takes into account the continuity and consistency between the frames and establishes a judgment standard for the possible fuzziness of props. Images with high dynamic blur are not considered, because they are not useful for the content that may be studied.
During the technical inspection, we use the annotation tool EISeg to visualize the labeled image stacked on the original image, which can help us not to omit other information during the inspection. In addition, we have specified relevant parameters setting for annotation tool before labeling, which can reduce the variance in manual operation. Specifically, the technical standard of sampling can be described as: for dynamic fuzzy or high-speed moving annotation objects, the pixel error is within 3 pixels. For ordinary annotation objects such as props interacting with the scene or stationary props, the pixel error is within 5 pixels. The parameters of EISeg are set as follow: the segmentation threshold is set as 0.5, the label transparency is set as 0.75, and the visualization radius is set as 3.
For quality assurance, five experts will conduct a total of two rounds of visual inspection of the labeled masks according to the above standards, and the labeled images that do not meet the standards will be re-labeled. Table 2 shows the images which do not meet the standard and the corresponding problems, number of error pixels and items. Both two rounds of expert inspection need to meet the above criteria to ensure that the final high quality labels and usage availability. Fig. 7(a),(b) shows the error bars before and after the expert team’s verification in first and second inspection round. It can be conclude that after two rounds of expert inspection, the annotation errors of all classes of props in our dataset reduced from the 2-4 mm to 1.5-3.5 mm, which demonstrates the validity of our proposed criteria, and also proves the effectiveness and great contribution of expert inspection in indicating the quality of the labeling of our dataset.
Boxplots of annotation errors in all classes of props. ( a ) The first round inspection by the expert group; ( b ) The second round inspection by the expert group.
In order to prove the validity of our proposed dataset, four classical semantic segmentation models: DeepLabv3+ 19 , FCN 20 , PSPNet 21 , and SegFormer 22 were applied to evaluate the four semantic segmentation metric of aAcc, mIoU, mAcc and mDice, respectively. The definition of these metrics can be formulated as:
where T P , T N , F P , F N stand for the true positive, true negative, false negative, and false negative, respectively. N stands for the total number of classes. ω c stands for the total number of pixels in the class c . It it worth noting that both aAcc and mAcc show the average pixels classification performance of the model. However, the former does not take into account the difference in the number of pixels in different classes, while the latter uses the number of pixels in different classes to perform a weighted calculation. Moreover, both mIoU and mDice measure the average overlap between the model’s prediction results and the real labels in each category, which can well reflect the classification accuracy of the model at the pixel level. The difference is that mDice is less sensitive to noise and boundaries because it focuses more on the overall overlap; mIoU, on the other hand, is more sensitive to the boundary region, because the calculation of the union includes the region of the prediction error.
For the evaluation performance of the baseline models, It can be seen from the Table 3 and Fig. 8 that the performance of each baseline model is different for various metrics. It is worth noting that all the baseline models were not fine-tuned in our dataset, but were directly evaluated using the model weights pre-trained on other datasets to obtain the metric performance. For aAcc, the performance of each model exceeds 94%, showing strong foreground-background pixel classification ability. However, for mIoU and mDice, the evaluation performance of each model is significantly reduced, indicating that our dataset face challenges in dealing with complex, diverse, and culturally specific foreground item segmentation. These challenges can stem from items blocking each other, small size, high visual similarity, and a lack of clear boundaries, especially when the item does not have a high contrast to the background.
Semantic segmentation evaluation performance for four popular baseline models. The original images and corresponding ground truth are also given. Quantitative results show that the SegFormer performs the best.
In terms of advantages, our dataset provides detailed labeling of traditional Chinese martial arts props, which poses new challenges for developing refined high-precision small-target semantic segmentation models. In addition, our dataset shows a high degree of cultural relevance, which creates a basis for the development of future style-specific generation models and models that require the recognition of specific cultural objects.
Our open data complies with the license statement under CC BY 4.0, and the datasets 16 should refer to this article when using or referencing research objects. The license allows readers to distribute, remix, tweak and build works, but not to use the dataset for commercial purposes. Researchers using this dataset are required to provide a link to this License Agreement and indicate whether modifications have been made to the original work. We hope that ChinesePMD dataset will be available to more researchers and encourage more authors to publish their optimized codes and models, which will contribute to the development of semantic segmentation research in the film and television industry.
Our datasets 16 is public available in Science DB ( http://www.scidb.cn/en ), input link ( https://www.scidb.cn/anonymous/SlpaelFy ) or https://doi.org/10.57760/sciencedb.07008 . The JSON file is in the label folder under the lens dataset. In the label folder, you can also see some visual annotation content after segmentation. If you want to use it, you can directly call the JSON file named “annotations.json” in each dataset. The code for technical validation is public code, the dataset is accessible via the DOI of science DB, and the software EISeg for annotation is open source.
Siriborvornratanakul, T. Downstream semantic segmentation model for low-level surface crack detection. Advances in Multimedia 2022 , 3712289 (2022).
Article Google Scholar
Nilsson, D. Data-efficient learning of semantic segmentation. Lund University (2022).
Bressan, P. O. et al . Semantic segmentation with labeling uncertainty and class imbalance applied to vegetation mapping. International Journal of Applied Earth Observation and Geoinformation 108 , 102690 (2022).
Kittipongdaja, P. & Siriborvornratanakul, T. Automatic kidney segmentation using 2.5 d resunet and 2.5 d denseunet for malignant potential analysis in complex renal cyst based on ct images. EURASIP Journal on Image and Video Processing 2022 , 5 (2022).
Article ADS PubMed PubMed Central Google Scholar
Monasterio-Exposito, L., Pizarro, D. & Macias-Guarasa, J. Label augmentation to improve generalization of deep learning semantic segmentation of laparoscopic images. IEEE Access 10 , 37345–37359 (2022).
Abdigapporov, S., Miraliev, S., Kakani, V. & Kim, H. Joint multiclass object detection and semantic segmentation for autonomous driving. IEEE Access 11 , 37637–37649 (2023).
Dataset, C. Semantic understanding of urban street scenes. Germany: City Shapes (2016).
Everingham, M., Gool, L. V., Williams, C. K. I., Winn, J. M. & Zisserman, A. The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88 , 303–338, https://doi.org/10.1007/s11263-009-0275-4 (2010).
Lin, T. et al . Microsoft COCO: common objects in context. In Fleet, D. J., Pajdla, T., Schiele, B. & Tuytelaars, T. (eds.) Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V , vol. 8693 of Lecture Notes in Computer Science , 740–755, https://doi.org/10.1007/978-3-319-10602-1_48 (Springer, 2014).
Caba Heilbron, F., Escorcia, V., Ghanem, B. & Carlos Niebles, J. Activitynet: A large-scale video benchmark for human activity understanding. In Proceedings of the ieee conference on computer vision and pattern recognition , 961–970 (2015).
Tapaswi, M. et al . Movieqa: Understanding stories in movies through question-answering. In Proceedings of the IEEE conference on computer vision and pattern recognition , 4631–4640 (2016).
Huang, Q., Xiong, Y., Rao, A., Wang, J. & Lin, D. Movienet: A holistic dataset for movie understanding. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IV 16 , 709–727 (Springer, 2020).
Pont-Tuset, J. et al . The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 (2017).
Wei, D. et al . Youmvos: an actor-centric multi-shot video object segmentation dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 21044–21053 (2022).
Ding, H. et al . Mose: A new dataset for video object segmentation in complex scenes. In Proceedings of the IEEE/CVF International Conference on Computer Vision , 20224–20234 (2023).
Wang, Y. et al . Semantic segmentation dataset of Chinese martial arts classic movie props. ScienceDB https://doi.org/10.57760/sciencedb.07008 (2023).
Liu, Y. et al . Paddleseg: A high-efficient development toolkit for image segmentation 2101.06175 (2021).
Benenson, R., Popov, S. & Ferrari, V. Large-scale interactive object segmentation with human annotators. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , 11700–11709 (2019).
Chen, L., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Ferrari, V., Hebert, M., Sminchisescu, C. & Weiss, Y. (eds.) Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part VII , vol. 11211 of Lecture Notes in Computer Science , 833–851 https://doi.org/10.1007/978-3-030-01234-2_49 (Springer, 2018).
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015 , 3431–3440 https://doi.org/10.1109/CVPR.2015.7298965 (IEEE Computer Society, 2015).
Zhao, H., Shi, J., Qi, X., Wang, X. & Jia, J. Pyramid scene parsing network. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21-26, 2017 , 6230–6239, https://doi.org/10.1109/CVPR.2017.660 (IEEE Computer Society, 2017).
Xie, E. et al . Segformer: Simple and efficient design for semantic segmentation with transformers. In Ranzato, M., Beygelzimer, A., Dauphin, Y. N., Liang, P. & Vaughan, J. W. (eds.) Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual , 12077–12090, https://proceedings.neurips.cc/paper/2021/hash/64f1f27bf1b4ec22924fd0acb550c235-Abstract.html (2021).
Download references
This research is supported in part by the Zhejiang Provincial Natural Science Foundation of China (No.LTGG24F030002); the National Natural Science Foundation of China (No. 62206242); the Public Welfare Technology Application Research Project of Zhejiang Province, China (No. LGF21F010001).
These authors contributed equally: Suiyu Zhang, Rong Wang.
College of Media Engineering, Communication University of Zhejiang, Hangzhou, 310018, China
Suiyu Zhang, Rong Wang, Yaqi Wang, Xiaoyu Ma & Dingguo Yu
Key Lab of Film and TV Media Technology of Zhejiang Province, Hangzhou, 310018, China
Suiyu Zhang & Rong Wang
Department of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, 264209, China
School of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong, China
Hongyuan Zhang
School of Automation, Hangzhou Dianzi University, Hangzhou, 310018, China
You can also search for this author in PubMed Google Scholar
Suiyu Zhang is responsible for collecting film and television data, organizing related materials, dataset buliding, and funding acquisition. Rong Wang is responsible for original draft writing and graphic drawing. Yaqi Wang was responsible for funding acquisition, supervision, and data curation. Xiaoyu Ma was responsible for the technical verification of the dataset. Chengyu Wu is responsible for manuscript revision and data visualization, Hongyuan Zhang is responsible for manuscript revision and language rewriting. Zhi Li is responsible for content grammar checking. Dingguo Yu is responsible for supervision, project administration and funding acquisition.
Correspondence to Yaqi Wang or Dingguo Yu .
Competing interests.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .
Reprints and permissions
Cite this article.
Zhang, S., Wang, R., Wang, Y. et al. ChineseMPD: A Semantic Segmentation Dataset of Chinese Martial Arts Classic Movie Props. Sci Data 11 , 882 (2024). https://doi.org/10.1038/s41597-024-03701-6
Download citation
Received : 05 January 2023
Accepted : 30 July 2024
Published : 14 August 2024
DOI : https://doi.org/10.1038/s41597-024-03701-6
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.
A fundamental biological question is to understand how cell types and functions are determined by genomic and proteomic coding. A basic form of this question is to ask if small families of genes or proteins code for cell types. For example, it has been shown that the collection of homeodomain proteins can uniquely delineate all 118 neuron classes in the nematode C. elegans. However, unique characterization is neither robust nor rare. Our goal in this paper is to develop a rigorous methodology to characterize molecular codes. We show that in fact for information-theoretic reasons almost any sufficiently large collection of genes is able to disambiguate cell types, and that this property is not robust to noise. To quantify the discriminative properties of a molecular codebook in a more refined way, we develop new statistics - partition cardinality and partition entropy - borrowing ideas from coding theory. We prove these are robust to data perturbations, and then apply these in the C. elegans example and in cancer. In the worm, we show that the homeodomain transcription factor family is distinguished by coding for cell types sparsely and efficiently compared to a control of randomly selected family of genes. Furthermore, the resolution of cell type identities defined using molecular features increases as the worm embryo develops. In cancer, we perform a pan-cancer study where we use our statistics to quantify interpatient tumor heterogeneity and we identify the chromosome containing the HLA family as sparsely and efficiently coding for melanoma.
The authors have declared no competing interest.
View the discussion thread.
Thank you for your interest in spreading the word about bioRxiv.
NOTE: Your email address is requested solely to identify you as the sender of this article.
Discover the world's research
Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1209))
Included in the following conference series:
With the arrival of the era of mass tourism, tourism highway have become an important carrier for promoting the integrated development of transportation and tourism. This paper takes the Jilin section of the G331 highway as an example and conducted a questionnaire survey on 222 tourists. Using exploratory factor analysis, five common factors of tourist motivations were extracted: leisure and relaxation, return to nature, exploration and trial, socializing, and knowledge-seeking. Based on this, cluster analysis was conducted, yielding four main groups: self-improvement type, ecological nature type, novelty-seeking type, and dual-goal-seeking type. It was found that different types of tourists vary in the number of companions, residence, and transportation characteristics. These findings contribute to a deeper analysis of the tourist market for tourism highway, providing a theoretical reference for the practical management of tourism highway.
This is a preview of subscription content, log in via an institution to check access.
Subscribe and save.
Tax calculation will be finalised at checkout
Purchases are for personal use only
Institutional subscriptions
Mengmeng, W., Yaping, K., Chen Bing, G., Xiaofeng,: Concept, attributes, and classification of tourism highways. Highway 64 (03), 176–181 (2019)
Google Scholar
Jin, H.: Exploration of innovative design concepts and practices for tourism highways. Highway 63 (11), 220–223 (2018)
Smith, W.R.: Product differentiation and market segmentation as alternative marketing strategies. Market. Manage. (1995)
Yang Jianming, Y., Yaling, Y.L.: Market segmentation of tourists in Fuzhou national forest park: factor-cluster analysis based on recreational motivation. Scientia Silvae Sinicae 51 (09), 106–116 (2015)
Dann, G.M.S.: Tourism motivation: a appraisal. Ann. Tour. Res. 8 (2), 187–219 (1981)
Article Google Scholar
Guofang, L., Hongyue, L.: Travel behavior of tourists in Chengdu. J. Traffic Transp. Eng. Inf. 14 (02), 36–41 (2016)
Gang, Z., Jiaqi, Z.: Research on market segmentation of elderly tourism based on the perspective of Shi. Resour. Dev. Market 31 (12), 1540–1544 (2015)
Zhangxin, Y., Zancai, X., Yueliang, T.: Statistical testing for market segmentation in rural tourism. Stat. Decis. 34 (20), 114–117 (2018)
Yiling, L., Gennian, S.: How does a highway become a tourism resource. Highway 62 (03), 193–198 (2017)
Download references
Authors and affiliations.
Jilin Provincial Institute of Transportation Science, Changchun, 130012, China
Wentao Liu & Bing Tian
Harbin Institute of Technology, Harbin, 150090, China
Leqi Cui & Xiqiao Zhang
You can also search for this author in PubMed Google Scholar
Correspondence to Xiqiao Zhang .
Editors and affiliations.
Beijing Jiaotong University, Beijing, China
Department of Civil Engineering, Toronto Metropolitan University, Toronto, ON, Canada
Reprints and permissions
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Cite this paper.
Liu, W., Cui, L., Zhang, X., Tian, B. (2024). Research on Market Segmentation of Tourism Highway Travelers Based on Travel Behavior. In: Jia, L., Easa, S., Qin, Y. (eds) Developments and Applications in SmartRail, Traffic, and Transportation Engineering. ICSTTE 2023. Lecture Notes in Electrical Engineering, vol 1209. Springer, Singapore. https://doi.org/10.1007/978-981-97-3682-9_32
DOI : https://doi.org/10.1007/978-981-97-3682-9_32
Published : 14 August 2024
Publisher Name : Springer, Singapore
Print ISBN : 978-981-97-3681-2
Online ISBN : 978-981-97-3682-9
eBook Packages : Engineering Engineering (R0)
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Policies and ethics
You might be using an unsupported or outdated browser. To get the best possible experience please use the latest version of Chrome, Firefox, Safari, or Microsoft Edge to view this website. |
Published: Aug 12, 2024, 7:15am
How network segmentation works, the main benefits of network segmentation, network segmentation vs. microsegmentation, network segmentation vs. internal segmentation, network segmentation and zero trust, bottom line, frequently asked questions (faqs).
Network segmentation refers to the process of partitioning a network into subnetworks. There are several reasons a company or organization may want to segment its network, including to improve security and performance. Read on to learn about the main benefits of network segmentation and why it’s important.
These days, networks are both physical and on the cloud. With so many possible ways to connect to a network, such as BYOD (bring your own devices) and IoT (Internet of Things) , network segmentation is an important way to establish who can access which part of a network and organize traffic flows.
Today, most segmentation is done through software-defined networking (SDN) , which is an approach to networks that allows for flexible network configuration and meets the needs of cloud computing as opposed to traditional physical networks.
Featured Partners
Monday Service
On Monday Service's Website
AI-powered automations and workflows, customized self-service
On Freshdesk's Website
$25 per user per month
On Salesforce's Website
On Zendesk's Website
The main benefits of network segmentation are to improve network security and performance. Improving network security will protect a company in the long term, while improving performance can both increase business value and improve user experience.
Network segmentation is an important part of network security. “Companies typically segment their networks based on the sensitivity and security requirements of their systems,” says Josh Amishav-Zlatin , founder and CEO of the data breach monitoring company Breachsense.
Segmentation gives control over who can access which segments of the network. It is also important because it means that if a network breach occurs, the hacker would not have access to the whole network but only the network segment. “Networks without segmentation have a larger attack surface, enabling attackers to exploit vulnerabilities anywhere on the network to escalate privileges and gain unauthorized access to critical systems or sensitive data,” Amishav-Zlatin says.
“For example, systems that touch PII, customer records or sensitive financial or personal details should all be on their own subnet. In the event of a security incident, this will help contain the attack,” says Amishav-Zlatin.
A flat network also makes it difficult to recover from the damage of an attack. “In the event of an attack, the lack of segmentation will hinder incident response efforts to isolate infected systems and prevent malicious users from pivoting to other machines on the network,” Amishav-Zlatin says.
“Segmenting a network can help a company improve its bandwidth and performance,” says Dr. Chris Mattmann , Chief Technology and Innovation Officer (CTIO) at NASA Jet Propulsion Laboratory. Different employees of a company require different bandwidth and internet speeds.
For example, the HR team may not need the latest and fastest internet connection, while the data science team may require more bandwidth to run tests. Additionally, a company might want to ensure that the essential services of a network receive the fastest speeds and highest bandwidth. The process of partitioning a network by requirement and priority is a form of segmentation.
“You can’t do segmentation and partitioning without network analytics, which is understanding how people are using them, what the business requirements are for access and more,” Mattmann says. Insight into how networks are being used now and how they need to be used in the future for business purposes will help an organization make informed choices about network segmentation.
For example, a particular project may be, “such a critical portion of our value delivery stream in our business that it needs a dedicated superfast network. So we might partition it for business reasons,” Mattmann says.
Given the implications of network segmentation on speed and performance, it is important not only for improved access but also for business value.
“Microsegmentation” takes the concept of network segmentation one step further. While network segmentation focuses on dividing the network into multiple smaller parts, microsegmentation puts each application in its own zone.
“The advantage of the latter approach is that it gives admins the ability to enforce access controls on the application layer, monitor communication patterns and detect and respond to security threats more effectively,” says Amishav-Zlatin.
Internal segmentation is a subset of network segmentation. “Network segmentation refers to dividing the entire network into subnets, while internal segmentation refers to dividing an organization’s internal network into smaller segments,” Amishav-Zlatin says.
Zero trust is an approach to cybersecurity based on the idea that no user or asset should be trusted by default. One of the concepts of zero trust is the principle of least privilege, which means users should only have access to parts of the network they need and nothing more. This is important from a cybersecurity perspective because granting all users access to all parts of the network means that an internal threat actor would be able to harm the entire network, and an external hacker would be able to attack a larger surface area once inside the network.
Network segmentation can help establish a zero-trust network, as segmentation is a necessary part of limiting user access. To prevent an attack, for example, important information such as credit card numbers and personal identifiable information (PII) should be on their own subnetworks. Amishav-Zlatin also recommends using, “microsegmentation to enforce zero trust and the least-privilege policy for even finer control on how applications are allowed to interact with each other.”
Network segmentation divides a computer network into subnetworks. The main reasons for network segmentation are enhanced performance and improved security. Every business should implement network segmentation, as it’s an important way to establish network security, limit the risk and potential damage of a cyberattack and improve the overall network performance.
A financial company, for example, has a lot of sensitive information that it doesn’t everyone in the company to have access to. Network segmentation can ensure that only the people who have privileged access can see sensitive documents, while other employees cannot.
The purpose of network segmentation is to increase security and improve performance. When a network has segments, the surface area is broken into parts, which limit the access an internal or external threat actor can have. Network performance can also improve through network segmentation by controlling which traffic gets directed to which parts of the network.
Every company or organization can benefit from network segmentation. Implementing some form of network segmentation will help prevent both internal and external security breaches.
Leeron is a New York-based writer with experience covering technology and politics. Her work has appeared in publications such as Quartz, the Village Voice, Gothamist, and Slate.
Grab your spot at the free arXiv Accessibility Forum
Help | Advanced Search
Title: ensemble architecture in polyp segmentation.
Abstract: In this research, we revisit the architecture of semantic segmentation and evaluate the models excelling in polyp segmentation. We introduce an integrated framework that harnesses the advantages of different models to attain an optimal outcome. More specifically, we fuse the learned features from convolutional and transformer models for prediction, and we view this approach as an ensemble technique to enhance model performance. Our experiments on polyp segmentation reveal that the proposed architecture surpasses other top models, exhibiting improved learning capacity and resilience. The code is available at this https URL .
Subjects: | Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG) |
Cite as: | [cs.CV] |
(or [cs.CV] for this version) | |
Focus to learn more arXiv-issued DOI via DataCite |
Access paper:.
Code, data and media associated with this article, recommenders and search tools.
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .
COMMENTS
The majority of image segmentation algorithms may be. divided into three techniques: boundary-based segmentation, region-based segmentation, and hybrid -based segmentation. [1 3 ]. The first ...
Image segmentation is the most important part of image processing, separating an image into multiple meaningful parts. In this area of the image segmentation, new technologies emerge day after day. In this paper, an in-depth analysis is carried out on some frequently adopted image segmentation techniques such as thresholding based techniques, edge detection based or boundary based techniques ...
In addition, a non-parametric analysis was conducted to assess the relationship between article focus and CABS journal ranking. Based on Spearman's correlation, research on pure B2B market segmentation is significantly less likely (r: −0.270, n = 88, p < 0.05) to appear in top-tier publication outlets (i.e., JM, JMR, MS, and JAMS) than was research focusing on both B2B and B2C settings.
Image segmentation, which has become a research hotspot in the field of image processing and computer vision, refers to the process of dividing an image into meaningful and non-overlapping regions, and it is an essential step in natural scene understanding. Despite decades of effort and many achievements, there are still challenges in feature extraction and model design. In this paper, we ...
There are several. techniques of image segmentation like thresholding. method, region based method, edge based method, clustering methods and the watershed method etc. In this paper we will see ...
There are two types of segmentation commonly used in tourism: a priori geographical segmentation and a posteriori behavioral segmentation (Dolnicar and Leisch, 2005). ... This paper explores the ...
Segmentation has been a rooted area of research having diverse dimensions. The roots of image segmentation and its associated techniques have supported computer vision, pattern recognition, image processing, and it holds variegated applications in crucial domains. To compile the vast literature on machine learning and deep learning-based segmentation techniques and proffer statistical ...
The technology of image segmentation is widely used in medical image processing, face recognition pedestrian detection, etc. The current image segmentation techniques include region-based segmentation, edge detection segmentation, segmentation based on clustering, segmentation based on weakly-supervised learning in CNN, etc. This paper analyzes and summarizes these algorithms of image ...
Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among many others. Various algorithms for image segmentation have been developed in the literature. Recently, due to the success of deep learning models in a wide range ...
Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer vision applications, providing key information for the global understanding of an image. This survey is an effort to summarize two decades of research in the field of SiS, where we propose a literature review of solutions starting from early historical methods followed by an overview of more recent deep ...
Layer-Based Segmentation Methods Layered model: for object detection and image segmentation that composites the output of a bank of object detectors in order to define shape masks and explain the appearance, depth ordering, and that evaluates both class and instance segmentation [10, 21]. This type didn't discuss in this paper.
Image segmentation is the task of associating pixels in an image with their respective object class labels. It has a wide range of applications in many industries including healthcare, transportation, robotics, fashion, home improvement, and tourism. Many deep learning-based approaches have been developed for image-level object recognition and pixel-level scene understanding-with the latter ...
75. Paper. Code. **Semantic Segmentation** is a computer vision task in which the goal is to categorize each pixel in an image into a class or object. The goal is to produce a dense pixel-wise segmentation map of an image, where each pixel is assigned to a specific class or object. Some example benchmarks for this task are Cityscapes, PASCAL ...
Image segmentation is a prerequisite for image processing. There are many methods for image segmentation, and as a result, a great number of methods for evaluating segmentation results have also been proposed. How to effectively evaluate the quality of image segmentation is very important. In this paper, the existing image segmentation quality evaluation methods are summarized, mainly ...
4. A segmentation base is identified as the distinguished base and a model is developed for predicting this base from other (possibly external) variables. The present paper highlights the definition and major basis of market segmentation. This research paper is broadly divided in to four parts.
In addition, as can be shown in Figure 1 and Figure 2, deep learning-based brain tumor segmentation and classification techniques are becoming the most active research area. In this paper, a comprehensive survey on region growing, shallow machine learning, and deep learning-based brain tumor segmentation and classification methods are presented.
Based on this paper corpus, we identified a four-phase process consisting of information (data) collection, customer representation, customer analysis via segmentation and customer targeting. With respect to customer representation and customer analysis by segmentation, we provide a comprehensive overview of the methods used in these process steps.
In sum, this chapter explains the three stages of target marketing, including; market segmentation (ii) market targeting and (iii) market positioning. Discover the world's research 25+ million members
This type of segmentation provides a deeper understanding of the desires, needs, and decision-making considerations of a potential user of a product or service. Applied correctly, it could enhance efficacy of public health initiatives, ensure new products reach the people most likely to need and use them, and increase the likelihood of lasting ...
Image Segmentation Using Deep Learning: A Survey. Plaza, Nasser Kehtarnavaz, and Demetri TerzopoulosAbstract—Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented. reality, and image compression, among ...
Semantic segmentation and scene understanding and have become important research directions in the field of computer vision 1,2,3,4,5,6, with applications covering a wide range of areas including ...
07054 ; Phone: (973) 658 1719; Fax: (973) 658 1701; Email: [email protected]. Bruce Cooil acknowledges support from the Dean's Fund for Faculty Research, Owen Graduate School, Vanderbilt ...
Semantic image segmentation, which becomes one of the key applications in image pro-cessing and computer vision domain, has been used in multiple domains such as medical area and intelligent transportation. Lots of benchmark datasets are released for researchers to verify their algorithms. Semantic segmentation has been studied for many years.
The Journal of Field Robotics is an applied robotics journal publishing theoretical and practical papers on robotics used in real-world applications. Abstract In ground mobile robots, effective path planning relies on their ability to assess the types and conditions of the surrounding terrains. ... This research was supported by the National ...
In the field of point cloud segmentation research, deep learning methods have been highly favored by researchers. There are mainly three methods: projection-based, voxel-based, and point-based methods, each with its own advantages and disadvantages. The following is a detailed introduction to these methods. 2.1 Projection-based method
A fundamental biological question is to understand how cell types and functions are determined by genomic and proteomic coding. A basic form of this question is to ask if small families of genes or proteins code for cell types. For example, it has been shown that the collection of homeodomain proteins can uniquely delineate all 118 neuron classes in the nematode C. elegans. However, unique ...
In 2005, Christensen, Cook and Hall, in the Harvard Business Review, found that of 30,000 new products launched in the US, 85% failed because of poor market segmentation. Yankelovich's paper in ...
3.1 Basic Sample Characteristics Analysis. A descriptive statistical analysis of the survey sample was carried out. The results show that male tourists accounted for the majority of the total number of people, accounting for 61.50%; the majority of tourists were middle-aged and young people, with 49.50% of the tourists aged 18-35, 32.50% of the tourists aged 36-60, and a lower percentage ...
Segmentation gives control over who can access which segments of the network. It is also important because it means that if a network breach occurs, the hacker would not have access to the whole ...
In this research, we revisit the architecture of semantic segmentation and evaluate the models excelling in polyp segmentation. We introduce an integrated framework that harnesses the advantages of different models to attain an optimal outcome. More specifically, we fuse the learned features from convolutional and transformer models for prediction, and we view this approach as an ensemble ...