N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis
{"title":"FTR拍卖中市场参与者的强化学习算法","authors":"N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis","doi":"10.1109/PCT.2007.4538442","DOIUrl":null,"url":null,"abstract":"This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.","PeriodicalId":356805,"journal":{"name":"2007 IEEE Lausanne Power Tech","volume":"275 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A Reinforcement Learning Algorithm for Market Participants in FTR Auctions\",\"authors\":\"N. P. Ziogos, A. C. Tellidou, V. P. Gountis, A. Bakirtzis\",\"doi\":\"10.1109/PCT.2007.4538442\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.\",\"PeriodicalId\":356805,\"journal\":{\"name\":\"2007 IEEE Lausanne Power Tech\",\"volume\":\"275 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE Lausanne Power Tech\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCT.2007.4538442\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE Lausanne Power Tech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCT.2007.4538442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Reinforcement Learning Algorithm for Market Participants in FTR Auctions
This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bi- level optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Q- learning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.