Journal of Command and Control

fa بهبود جستجتوی یادگیری تقویتی عمیق با بهینه سازی مستعمره مورچه Improving search for deep reinforcement learning with ant colony optimization مهندسی کامپیوتر Computer Engineering پژوهشي Research ظهور اکوسیستم عظیم اینترنت اشیا<a href="#_ftn1" name="_ftnref1" title="">[1]</a> (IoT) در حال تغییر سبک زندگی انسان است. اینترنت اشیا هنوز هم متکی به کمک های انسانی است و زمان پاسخ دهی غیرقابل قبول برای بررسی داده های بزرگ دارد و همچنان با چالش های قابل توجهی روبرو هستند. بنابراین، ایجاد چارچوب و الگوریتم جدید برای حل مشکلات خاص اینترنت اشیا سریع، بسیار ضروری است. رویکردهای یادگیری تقویتی و یادگیری تقویتی عمیق<a href="#_ftn2" name="_ftnref2" title="">[2]</a> (DRL) توانایی تصمیم گیری را دارند، اما روشهای مدلسازی و آموزش سنتی، وقت گیر بوده و کاربردهای آنها را محدود می کنند. این مقاله برای غلبه بر این معضل، روش یادگیری تقویتی متناسب با اینترنت اشیا را پیشنهاد می کند. به این صورت که یک روش انتخاب ویژگی مبتنی بر بهینه سازی مستعمره مورچه<a href="#_ftn3" name="_ftnref3" title="">[3]</a> (ACO) پیشنهاد می کنیم. از آنجا که توابع اکتشافی بر روند تصمیم گیری ACO در طی فرآیند جستجو تأثیر می گذارد، استفاده از روش یادگیری ابتکاری می تواند به الگوریتم کمک کند تا در فضای جستجو بهتر جستجو کند. سرانجام، به عنوان مطالعه موردی اینترنت اشیا، روش پیشنهادی برای کنترل چراغ راهنمایی، با هدف کاهش ازدحام ترافیک در تقاطع های شهرهای هوشمند، اعمال می شود. نتایج تجربی نشان می دهد که روش پیشنهادی می تواند در مقایسه با رویکردهای سنتی، اقدامات بهتری را در زمان اجرای کوتاهتر بیاموزد. <div>  <hr> <div id="ftn1"><a href="#_ftnref1" name="_ftn1" title="">[1]</a> Internet of Things</div> <div id="ftn2"><a href="#_ftnref2" name="_ftn2" title="">[2]</a> Deep Reinforcement Learning</div> <div id="ftn3"><a href="#_ftnref3" name="_ftn3" title="">[3]</a> Ant Colony Optimization</div> </div> The emergence of the massive Internet of Things (IoT) ecosystem is changing human lifestyles. The Internet of Things still relies on human assistance and has unacceptable response times for analyzing big data, and still faces significant challenges. Therefore, it is very necessary to create a new framework and algorithm to solve the specific problems of fast Internet of Things. Reinforcement learning and deep reinforcement learning (DRL) approaches have the ability to make decisions, but traditional modeling and training methods are time-consuming and limit their applications. To overcome this problem, this article proposes a reinforcement learning method suitable for the Internet of Things. In this way, we propose a feature selection method based on ant colony optimization (ACO). Since the heuristic functions affect the decision-making process of ACO during the search process, the use of heuristic learning method can help the algorithm to search the search space better. Finally, as a case study of IoT, the proposed method is applied to traffic light control, with the aim of reducing traffic congestion in smart city intersections. Experimental results show that the proposed method can learn better actions in a shorter execution time compared to traditional approaches. 1 11 http://ic4i-journal.ir/browse.php?a_code=A-10-319-2&slc_lang=fa&sid=1 Mohammad Hassan Nataj Solhdar محمد حسن نتاج صلحدار nataj.solhdar@gmail.com 10031947532846002434 10031947532846002434 Yes دانشگاه شهید چمران اهواز_پردیس صنعتی شهدای هویزه