FTR拍卖中市场参与者的强化学习算法

2007 IEEE Lausanne Power Tech Pub Date : 2007-07-01 DOI:10.1109/PCT.2007.4538442

N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis

{"title":"FTR拍卖中市场参与者的强化学习算法","authors":"N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis","doi":"10.1109/PCT.2007.4538442","DOIUrl":null,"url":null,"abstract":"This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.","PeriodicalId":356805,"journal":{"name":"2007 IEEE Lausanne Power Tech","volume":"275 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A Reinforcement Learning Algorithm for Market Participants in FTR Auctions\",\"authors\":\"N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis\",\"doi\":\"10.1109/PCT.2007.4538442\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.\",\"PeriodicalId\":356805,\"journal\":{\"name\":\"2007 IEEE Lausanne Power Tech\",\"volume\":\"275 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE Lausanne Power Tech\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCT.2007.4538442\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE Lausanne Power Tech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCT.2007.4538442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

本文提出了一种q -学习算法，用于制定FTR拍卖中市场参与者的竞价策略。每个市场参与者都由一个自主自适应代理代表，该代理能够基于q学习算法开发自己的投标行为。首先，提出了一个双层优化问题。在第一级，市场参与者试图最大化其预期利润，而在第二级，独立系统运营商试图最大化FTR拍卖的收入。假设每个FTR市场参与者根据对退出点和注入点之间LMP差异的概率估计来选择其持有FTR的投标策略。计算市场参与者的期望利润，并采用Q学习算法寻找最优竞价策略。以双母线和五母线测试系统为例说明了该方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Reinforcement Learning Algorithm for Market Participants in FTR Auctions

This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 IEEE Lausanne Power Tech

自引率

0.00%

发文量