国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

基于兩階分區的MapReduce實(shí)驗室系統負載均衡研究
DOI:
CSTR:
作者:
作者單位:

1.深圳市檢驗檢疫科學(xué)研究院;2.深圳市檢驗檢疫科學(xué)研究院深圳

作者簡(jiǎn)介:

通訊作者:

中圖分類(lèi)號:

TP301.6????

基金項目:

國家重點(diǎn)研發(fā)計劃課題(2019YFC1605401);海關(guān)總署課題(2020HK109)。


Research on load balancing of MapReduce laboratory system based on two-tier partition
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪(fǎng)問(wèn)統計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    在實(shí)驗室系統處理海量原始數據時(shí),實(shí)際應用場(chǎng)景中存在采樣率高、偏度(skewness)高的特殊情況,導致在使用兩階分區算法在平衡同構環(huán)境下的Reducer節點(diǎn)負載時(shí),無(wú)法有效地處理這些問(wèn)題。為此,引入MapReduce的并行化處理,可以提高實(shí)驗室系統中采樣數據利用率;同時(shí),為了解決數據偏度和采樣度高的問(wèn)題,則采用了ICSC(Improved Cluster Split Combination)分區調度的算法。經(jīng)過(guò)實(shí)驗證明,基于兩階分區的MapReduce負載均衡算法能夠有效減少Mapper和Reducer節點(diǎn)空轉的時(shí)間。隨著(zhù)數據偏度的增加,算法的執行時(shí)長(cháng)基本不產(chǎn)生變化,即數據偏度對該算法執行時(shí)間的影響較小。此外,數據采樣度的增加,ICSC分區調度算法也保持著(zhù)對比模型中最少的時(shí)間開(kāi)銷(xiāo)。因此,基于兩階分區的MapReduce負載均衡算法弱化了Reducer節點(diǎn)間的依賴(lài)性,并提升MapReduce任務(wù)的執行效率和容錯率,從而高效地實(shí)現MapReduce框架下的實(shí)驗室系統中數據處理的負載均衡。

    Abstract:

    When processing raw data in a laboratory system, there are special cases of high sampling rate and high skewness in real-world application scenarios, which cannot be effectively dealt with when balancing the load on the Reducer nodes in a homogeneous environment using a two-order partitioning algorithm. Therefore, the parallel processing of MapReduce is introduced to improve the utilization of sampling data in the laboratory system; At the same time, in order to solve the problem of data skewness and high sampling, ICSC (Improved Cluster Split Combination) partition scheduling algorithm is adopted. Experiments show that MapReduce load balancing algorithm based on two-tier partition can effectively reduce the idle time of Mapper and Reducer nodes. With the increase of data skewness, the execution time of the algorithm is basically unchanged, that is, data skewness has little impact on the execution time of the algorithm. In addition, with the increase of data sampling, ICSC partition scheduling algorithm also maintains the minimum time cost in the comparison model. Therefore, the MapReduce load balancing algorithm based on two-tier partitions weakens the dependency between the reducer nodes, and improves the execution efficiency and fault tolerance of MapReduce tasks, thus effectively realizing the load balancing of data processing in the laboratory system under the MapReduce framework.

    參考文獻
    相似文獻
    引證文獻
引用本文

鄭文麗,熊貝貝,程立勛,蔡伊娜,包先雨.基于兩階分區的MapReduce實(shí)驗室系統負載均衡研究計算機測量與控制[J].,2023,31(4):252-257.

復制
分享
文章指標
  • 點(diǎn)擊次數:
  • 下載次數:
  • HTML閱讀次數:
  • 引用次數:
歷史
  • 收稿日期:2022-11-11
  • 最后修改日期:2022-12-19
  • 錄用日期:2023-01-03
  • 在線(xiàn)發(fā)布日期: 2023-04-24
  • 出版日期:
文章二維碼
渭源县| 万州区| 宜州市| 繁昌县| 昆明市| 彭州市| 肃北| 张家口市| 阳朔县| 柳州市| 墨玉县| 长兴县| 罗江县| 高淳县| 富平县| 武夷山市| 湾仔区| 芜湖市| 湘西| 大厂| 盘锦市| 淮南市| 咸丰县| 西城区| 赫章县| 延庆县| 丹巴县| 定州市| 万源市| 镇巴县| 南开区| 东宁县| 罗定市| 屯昌县| 喀什市| 江西省| 怀远县| 保山市| 鄂伦春自治旗| 宣城市| 武义县|