¿­·¢ÌìÉúÓ®¼ÒÒ»´¥¼´·¢Ê×Ò³

ËÑË÷ èÑÛÓ°Ï· ÈÚýÌ徨Õó
  • ɽ¶«ÊÖ»ú±¨

  • èÑÛÓ°Ï·

  • ¹«¹²Íø¹Ù·½Î¢ÐÅ

  • ¹«¹²Íø¹Ù·½Î¢²©

  • ¶¶Òô

  • ÈËÃñºÅ

  • È«¹úµ³Ã½Æ½Ì¨

  • ÑëÊÓÆµ

  • °Ù¼ÒºÅ

  • ¿ìÊÖ

  • Í·ÌõºÅ

  • ßÙÁ¨ßÙÁ¨

Ê×Ò³ >ÐÂÎÅ >Éç»áÐÂÎÅ

ÆÈ½üClaude 3.5¡¢±¾Ç®½µµÍ86% £¬¿ªÔ´´úÂ붨λÐÂÉñÆ÷LocAgentÀ´ÁË

2025-06-04 16:50:59
À´Ô´£º

èÑÛÓ°Ï·

×÷Õߣº

瑞纳

ÊÖ»ú¼ì²ì

¡¡¡¡Ã¨ÑÛÓ°Ï·¼ÇÕß 鄂家村 ±¨µÀw3u7903ejky2ywls

ÓÖÊÇÒ»¸öÈóÌÐòÔ±¿ñ»¶µÄÑо¿£¡À´×Ô OpenHands¡¢Ò®Â³¡¢ÄϼӴóºÍ˹̹¸£µÄÑо¿ÍŶӸոÕÐû²¼ÁËLocAgent¡ª¡ª Ò»¸öרÃÅÓÃÓÚ´úÂ붨λµÄͼË÷Òý LLM Agent ¿ò¼Ü £¬Ö±½Ó°Ñ´úÂ붨λ׼ȷÂÊÀ­µ½ÁË 92.7% µÄи߶È¡£¸ÃÑо¿Òѱ» ACL 2025 ¼Óá£

ÂÛÎÄÌâÄ¿£ºLocAgent: Graph-Guided LLM Agents for Code LocalizationÂÛÎÄÁ´½Ó£ºhttps://arxiv.org/abs/2503.09089´úÂëÁ´½Ó£ºhttps://github.com/gersteinlab/LocAgent

Ò»¡¢Í´µãºÜÕæÊµ£º´úÂ붨λ¾¿¾¹ÓжàÔÖ£¿

ÏàÐÅÿ¸ö³ÌÐòÔ±¶¼ÓйýÕâÑùµÄ¾­Àú£º¿´µ½Ò»¸ö bug ±¨¸æ £¬ÂúÁ³ÎʺŵØÏ롸Õ⾿¾¹Òª¸ÄÄÄÀ¡¹¡£¹Å°åÒªÁìҪô¿¿Òªº¦´ÊÆ¥Å䣨̫´Ö²Ú£© £¬ÒªÃ´Ö±½Ó°ÑÕû¸ö´úÂë¿â¶ª¸ø LLMs£¨Ì«µÍЧ£© £¬ÒªÃ´Èà Agent äĿ±éÀúĿ¼£¨Ì«ð¯×¾£©¡£

ÎÊÌâµÄ½¹µãÔÚÓÚ£º×ÔÈ»ÓïÑÔÃèÊöµÄÎÊÌâºÍÕæÕýÐèÒªÐÞ¸´µÄ´úÂëλÖÃÖ®¼ä £¬ÍùÍù¸ôןü¸²ãŲÓùØÏµ¡£ºÃ±ÈÓû§·´Ï졸XSS ©¶´¡¹ £¬µ«Êµ¼ÊÐèÒªÐ޸ĵĿÉÄÜÊÇij¸öÉî²ãµÄÑéÖ¤¹¤¾ßº¯Êý¡£

»»ÑÔÖ® £¬´úÂ붨λָµÄÊÇÔÚ´óÐÍ´úÂë¿âÖо«È·ÕÒµ½ÐèÒªÐ޸ĵĴúÂëλÖà £¬ÔÚÈí¼þ¿ª·¢Óëά»¤ÖÐ £¬×¼È·µØ¶¨Î»´úÂëÎÊÌâÊÇÌá¸ß¿ª·¢Ð§ÂʵÄÒªº¦£¨Í¼ 1 չʾÁËËÄÖÖ³£¼ûµÄ´úÂëÐÞ¸´³¡¾°£©¡£

ͼ 1£º¸ø¶¨Ò»¸ö´úÂë¿â£¨×󣩺ÍÎÊÌâÃèÊö£¨ÖÐ £¬°üÀ¨ËÄÖÖ³¡¾°µÄʾÀý£© £¬´úÂ붨λÐèҪʶ±ð³öÐèÒªÐ޸ĵÄÏà¹Ø´úÂëλÖã¨ÓÒ£© £¬°üÀ¨¾ßÌåµÄÎļþ¡¢ÀàºÍº¯Êý¡£LocAgent Ö¼ÔÚÈà LLM Agent ×Ô¶¯Íê³ÉÕâÒ»Àú³Ì¡£

×ÔÈ»ÓïÑÔÖеÄÎÊÌâÃèÊö£¨Èç¹ýʧ±¨¸æ£©ÍùÍùÓëÕæÕýµÄ¹ÊÕϸùÒò±£´æÏÔÖøµÄÓïÒå²î±ðÓë½á¹¹¾àÀ루Èçͼ 2 Ëùʾ£©¡£Õâ²»µ«ÒªÇóÄ£ÐÍÄܹ»ÉîÈëÀí½â×ÔÈ»ÓïÑÔ±àдµÄ¹ýʧ±¨¸æ £¬»¹Ðè¾ß±¸ÔÚÅÓ´ó´úÂë¿âÖпçÔ½²ã¼¶½á¹¹ºÍÅÓ´óÒÀÀµ¹ØÏµ½øÐÐÍÆÀíºÍ×·×ÙµÄÄÜÁ¦¡£

ͼ 2: ͼÖкìÉ«½ÚµãÌåÏÖÎÊÌâÃèÊöÖÐÃ÷È·Ìá¼°µÄº¯Êý £¬»ÆÉ«½ÚµãÌåÏÖʵ¼ÊÐèÒªÐ޸ģ¨ÐÞ²¹£©µÄº¯Êý¡£ÈÎÎñÄѶȽç˵Ϊ´úÂëͼÖдÓÌá¼°º¯Êýµ½Ä¿±êÐÞ²¹º¯ÊýÖ®¼äµÄ×î¶Ì·¾¶³¤¶È£¨×îÉÙÌøÊý£© £¬Í¼Ê¾ÀýÖÐÈÎÎñÄѶÈΪ 2 Ìø¡£

¶þ¡¢LocAgent£º¸ø LLM ×°ÉÏ¡¸´úÂëµØÍ¼¡¹

¸ÃÑо¿ÍŶӵĽâ¾ö¼Æ»®Ï൱ÇÉÃÊ×ÏÈËûÃǰÑÕû¸ö´úÂë¿â½âÎö³ÉÒ»ÕÅͼ £¬°üÀ¨Îļþ¡¢Àà¡¢º¯ÊýÖ®¼äµÄ°üÀ¨¡¢Å²ÓᢼÌÐø¡¢µ¼Èë¹ØÏµ¡£È»ºó¸ÃÍŶÓΪ LLM Agent Ìṩ¼ò½àͳһµÄͼԭÓï½Ó¿Ú £¬ÒÔÖ§³ÖÀëЧ̽Ë÷´úÂë¿â¡£¸ÃÒªÁìͨ¹ý½«´úÂë¿â½âÎöΪÒ칹ͼÌåÏÖ £¬ÈôóÓïÑÔÄ£ÐÍÄܹ»ÏñʹÓõØÍ¼Ò»Ñù¸ßЧµØÔÚ´úÂëÖС¸Òƶ¯¡¹ £¬ÊµÏÖ¶àÌøÍÆÀí £¬Öð²½½Ó½üÄ¿±ê´úÂë¡£

ͼ 3£ºLocAgent ¿ò¼Ü¸ÅÀÀ

Èçͼ 3 Ëùʾ £¬LocAgent Ê×ÏȽ«´úÂë¿â½âÎöΪһ¸öÒ칹ͼÌåÏÖ £¬Í¼ÖаüÀ¨¶àÖÖÀàÐ͵ĴúÂëʵÌå¼°ÆäÒÀÀµ¹ØÏµ¡£ÔÚ´Ë»ù´¡ÉÏ £¬ÏµÍ³¹¹½¨ÁË·Ö²ãÏ¡ÊèË÷Òý £¬ÓÃÓÚÖ§³Ö¸ßЧµÄÄÚÈݼìË÷Óë½á¹¹»¯Ì½Ë÷¡£½èÖúÕâЩË÷Òý £¬LocAgent Äܹ»½áºÏͼ½á¹¹Ó빤¾ß½Ó¿Ú £¬Ö´ÐÐÓÉ Agent Çý¶¯µÄÖð²½ËÑË÷Àú³Ì £¬¾«×¼Íê³É´úÂ붨λÈÎÎñ¡£

2.1 ´úÂëÌåÏÖ¹¹½¨Àú³Ì

´úÂëͼÌåÏÖ¹¹½¨£ºÎªÍ³Ò»ÌåÏÖ´úÂë¿âÖеĽṹÓëÓïÒåÐÅÏ¢ £¬LocAgent »ùÓÚÁýͳÓï·¨Ê÷£¨AST£© ¶Ô´úÂë¿â½øÐнâÎö £¬¹¹½¨Ò»¸öÒì¹¹ÓÐÏòͼ ×÷Ϊ½á¹¹»¯Ë÷Òý £¬ÏêϸÌåÏÖÁË´úÂëĿ¼¡¢Îļþ¡¢Àà¡¢º¯ÊýÖ®¼äµÄ°üÀ¨¡¢Å²Óᢵ¼ÈëºÍ¼ÌÐø¹ØÏµ £¬Ê¹µÃÒþʽÒÀÀµÏÔÐÔ»¯ £¬±ãÓÚ LLM ¸ßÐ§ÍÆÀí¡£

ÕâÖÖͼ½á¹¹µÄÓÅÊÆÔÚÓÚ£º×ÝÈ»Á½¸ö´úÂëÆ¬¶Î·Ö´¦²î±ðÄ£¿é £¬Ö»Òª±£´æÅ²Óûò¼ÌÐø¹ØÏµ £¬ÔÚͼÉÏËüÃǾͻá±äµÃ¡¸ÁÚ½ü¡¹¡£ºÃ±È £¬ÒÔÍù»ùÓÚĿ¼µ¼º½µÄÒªÁì»áÈÏΪԶ¸ôÁ½¸ö×ÓĿ¼µÄÄ£¿é¾ø²»Ïà¸É £¬µ«Èç¹ûÄ£¿é A º¯ÊýŲÓÃÁËÄ£¿é B £¬ÔÚ LocAgent µÄͼÖÐ A ºÍ B »áͨ¹ýŲÓñßÖ±½ÓÁ¬½Ó £¬Ê¹ËüÃÇÔÚ¸Ãͼ½á¹¹ÉÏ¿¿½ü¡£¹ØÓÚ´úÂ붨λÈÎÎñ £¬ÕâÖÖ¡¸ÁÚ½ü¡¹ÖÁ¹ØÖØÒª £¬ÒòΪÐí¶àÎÊÌâ²»ÊǾÖÏÞÔÚµ¥¸öÎļþ¼ÐÄÚ²¿ £¬¶øÊÇͨ¹ýŲÓÃÁ´¿çÔ½¶à¸öÄ£¿é¡£

2.2 Ìṩ¹¤¾ß½Ó¿Ú¹© Agent ÅÌÎÊ

¹¹½¨ºÃ´úÂëͼºó £¬LocAgent ÌṩÁËͳһµÄ¹¤¾ß½Ó¿Ú £¬Èà LLM Agent Äܹ»±ãµ±ÍÁµØÎÊͼ½á¹¹ºÍ´úÂëÄÚÈÝ¡£Ö÷Òª°üÀ¨ÒÔÏÂÈý¸ö API£º

SearchEntity£º¸Ã¹¤¾ß»ùÓÚÌõÀí»¯ÊµÌåË÷Òý £¬Ê¹ÓÃÒªº¦´ÊËÑË÷´úÂë¿âÖÐÏà¹ØÊµÌå¡£µ±ÔÚÉϲãË÷ÒýÖÐδÄÜÕÒµ½Æ¥ÅäÏîʱ £¬ÏµÍ³»á×Ô¶¯Ê¹ÓÃÏÂÒ»²ãË÷Òý½øÐÐËÑË÷ £¬´Ó¾«È·Æ¥Å䵽ģºýËÑË÷ £¬ÒÔ²éÕÒ×î½Ó½üµÄÆ¥ÅäÏî¡£¹ØÓÚ¼ìË÷µ½µÄÿ¸öʵÌå £¬SearchEntity »á·µ»Ø¸Ã´úÂëÆ¬¶ÎµÄÕªÒª£¨Èçͼ 4 £¬ÓÐÕÛµþ¼¶±ð¡¢Ô¤ÀÀ¼¶±ðºÍÍêÕû´úÂëÈý¼¶ £¬¿Éƾ¾ÝÐèÒªÕ¹¿ª£©¡£

ͼ 4: Ϊ¸ßЧµÄ Agent ´úÂë½»»¥¶øÉè¼ÆµÄ²î±ðÊäÌØ±ðʽʾÀý¡£

RetrieveEntity£ºµ± Agent È·¶¨ÁËij¸ö´úÂëʵÌåºÜ¿ÉÄܾÍÊÇÄ¿±êʱ £¬¿ÉÒÔÓô˹¤¾ßÌáÈ¡¸ÃʵÌåµÄÍêÕûÐÅÏ¢¡£µ±ÊäÈëʵÌå ID £¬RetrieveEntity Êä³ö¸ÃʵÌåµÄÎļþ·¾¶¡¢ÆðÖ¹Ðкš¢ÍêÕû´úÂëÄÚÈݵÈÏêϸÊôÐÔ¡£TraverseGraph£º¸Ã¹¤¾ßÔÚ´úÂëͼÉÏÖ´ÐÐÀàÐ͸ÐÖªµÄ¹ã¶ÈÓÅÏÈËÑË÷¡£Agent ¿ÉÒÔÖ¸¶¨ÆðʼµÄʵÌå ID £¬ÒÔ¼°Ï£Íû±éÀúµÄÆ«Ïò¡¢²½Êý£¨hops£©¡¢ÊµÌåÀàÐͺ͹ØÏµÀàÐ͵ȲÎÊý¡£¹¤¾ß»áÔÚͼÖÐ´ÓÆðµã³ö·¢Æ¾¾ÝÒªÇó×ßÖ¸¶¨²½Êý £¬·µ»Ø±éÀúµ½µÄ×Óͼ½á¹¹¡£Í¨¹ýÉèÖòî±ðµÄÀàÐ͹ýÂË £¬Agent ¿ÉÒÔÁé»îµØÌ½Ë÷ºÃ±È¡¸ÑØÅ²ÓùØÏµÏòÏÂ×·×ÙÁ½²½¡¹»ò¡¸¼ì²ì´ÓijÀà³ö·¢µÄ¼ÌÐøÌõÀí¡¹µÈµÈ¡£ÖµµÃÒ»ÌáµÄÊÇ £¬TraverseGraph ½«·µ»ØµÄ×Óͼ»¨Ñù»¯³ÉÒ»ÖÖÊ÷×´½á¹¹Îı¾£¨¼ûͼ 5£© £¬ÒÔ±ã LLM ¸üÈÝÒ×Àí½â¹ØÏµÍØÆË¡£

ͼ 5£ºTraverseGraph ¹¤¾ßÊä³öʾÀý¡£

2.3 Agent Çý¶¯µÄÍÆÀí½×¶Î

LocAgent ÔÚÌáʾÉè¼ÆÉϽÓÄÉÁË¡¸Öð²½Ë¼¿¼¡¹(Chain-of-Thought, CoT) µÄÕ½ÂÔ £¬Òýµ¼ LLM Agent ½«´úÂ붨λÈÎÎñÆÊÎöΪһϵÁа취 £¬Ä£ÄâÈËÀàµ÷ÊÔ˼·һ²½²½ÆÈ½üÄ¿±ê¡£Õû¸öÎÊÌâÇó½âÀú³Ì¿ÉÒÔ¸ÅÀ¨ÎªÒÔϽ׶Σº

ÎÊÌâÀí½âÓëÒªº¦´ÊÌáÈ¡£ºAgent Ê×ÏȶÔÊäÈëµÄ issue ÃèÊö½øÐÐÆÊÎö £¬»®·Ö³ö²î±ð·½ÃæµÄÐÅÏ¢ £¬È»ºóÌáÈ¡³öÓëÎÊÌâÏà¹ØµÄÒªº¦´Ê¡£ÕâЩҪº¦´ÊÏ൱ÓÚΪºóÐøËÑË÷Ö¸Ã÷ÁË¿ª¶ËÆ«Ïò¡£Á´½ÓÒªº¦´Êµ½´úÂëʵÌ壺Õë¶Ôÿ¸öÌáÈ¡µÄÒªº¦´Ê £¬Agent ŲÓà SearchEntity ¹¤¾ßÔÚ´úÂëË÷ÒýÖвéÕÒÆ¥ÅäµÄ´úÂëʵÌå¡£¶àÌøÍÆÀí £¬Éú³É¹ÊÕÏÁ´Â·£º½ÓÏÂÀ´ £¬Agent »áʵÑé´®ÁªÏßË÷ £¬´Ó±¨´í±íÕ÷ÍÆµ¼¹ÊÕÏÔ­Òò¡£ËüÏÈÈ·¶¨ÎÊÌâ´¥·¢µÄ³õʼÈë¿Úµã£¨ÀýÈç´¥·¢¹ýʧµÄ API »òº¯Êý£© £¬È»ºóÒÔÕâЩµãΪÆðµã £¬ÔÚ´úÂëͼÉϽøÐеü´ú̽Ë÷£ºÅ²Óà TraverseGraph ÑØÅ²ÓùØÏµ»òÒÀÀµ¹ØÏµÏòÏà¹ØÆ«ÏòËÑË÷£»Óà RetrieveEntity ¼ì²ìijЩҪº¦½ÚµãµÄʵÏÖϸ½Ú£»ÐëҪʱÔÙ´Î SearchEntity ÒýÈëеÄÒªº¦´Ê¡£Í¨¹ý¶àÂÖ½»ÌæÊ¹ÓÃÕâЩ¹¤¾ß £¬Agent Öð²½¹¹½¨ÆðÒ»Ìõ´ÓÎÊÌâÖ¢×´µ½Ç±ÔÚ¸ùÒòµÄÂß¼­Â·¾¶¡£Ëø¶¨Ä¿±ê´úÂ룺ÔÚÐγɶÔÎÊÌâµÄÈ«ÃæÀí½âºó £¬Agent ƾ¾Ý¡¸¹ÊÕÏÁ´Â·¡¹ÖÐ̻¶µÄ¿ÉÒÉ»·½Ú £¬¶¨Î»³öËùÓпÉÄÜÐèÒªÐ޸ĵÄÄ¿±ê´úÂëʵÌ壨¿ÉÄÜÊÇÈô¸É¸öº¯Êý»òÀࣩ¡£Ëæºó £¬Agent ¶ÔÕâЩºòѡʵÌå°´Ïà¹ØÐÔ½øÐÐÅÅÐòÊä³ö £¬²¢¸ø³öËüÃǵÄÎļþ·¾¶ÒÔ¼°¼òÒªµÄÔ­Òò˵Ã÷¡£

Õû¸ö LocAgent µÄʹÓöÔÓû§À´ËµºÜÊǼò½à£ºÖ»ÐèÊäÈë×ÔÈ»ÓïÑÔµÄÎÊÌâÃèÊö £¬ LLM Agent ¾Í»áÈçÉÏËùÊö×ÔÖ÷µØÍê³ÉһϵÁÐËÑË÷¡¢±éÀú¡¢¶ÁÈ¡²Ù×÷ £¬×îºó¸ø³ö´úÂ붨λ½á¹û¡£

Èý¡¢ÊµÑé½á¹û£ºÕæÏ㾯¸æ

LocAgent ÔÚÕæÊµÊý¾Ý¼¯ÉϵÄÌåÏÖºÍÆÊÎö½á¹ûÁîÈËÖõÄ¿¡£Ñо¿ÖÐʹÓÃÁ˼ÈÓеĻù×¼Êý¾Ý¼¯£¨SWE-Bench Lite£©ÒÔ¼°ÍŶÓй¹½¨µÄ Loc-Bench £¬±ÈÕÕÁ˶àÖÖ»ùÏßÒªÁìµÄ´úÂ붨λЧ¹û¡£

£¨1£©´úÂ붨λЧ¹û¾«²Ê

SWE-Bench Lite ÊÇ´Ó GitHub issue Öй¹½¨µÄ»õ²Ö¼¶´úÂëÐÞ¸´Êý¾Ý¼¯ £¬Ò²³£ÓÃÓÚ´úÂ붨λÆÀ¹À £¬°üÀ¨ 300 ¸öÎÊÌâ¼°Æä¶ÔÓ¦µÄÐÞ¸´´úÂë £¬ÆäÖд󲿷ÖΪ bug ±¨¸æ¡£»ùÓڸûù×¼ £¬LocAgent ʵÏÖÁËĿǰ×îÓŵĴúÂ붨λ׼ȷÂÊ £¬ÏÔÖøÓÅÓÚÏÖÓÐÒªÁì¡£

Ïà±È¹Å°åµÄÏòÁ¿¼ìË÷ÒªÁìÓÐÏÔÖøÌáÉý£ºBM25 ÔÚÎļþ¼¶ Acc@5 ÉϽöΪ 61.7% £¬¶øÏȽøµÄ´úÂëǶÈëÄ£ÐÍÈç CodeRankEmbed Ò²½öµÖ´ï 84.7%£»¶ø LocAgent ׼ȷÂʸߴï 92.7% £¬ÔÚº¯Êý¼¶¶¨Î»ÖÐҲͬÑùÏÔÖøÓÅÓÚÕâЩҪÁì¡£¶à²½ÍÆÀíµÄ Agent ÀàÒªÁìÕûÌåÉÏʤ¹ý»ùÓÚÀιÌÁ÷³ÌµÄÒªÁì¡£»ùÓÚÀιÌÁ÷³ÌµÄÒªÁ죨Èç Agentless£©ÍùÍùÖ»ÄÜÒÀ¾Ý×ÖÃæÆ¥ÅäÕÒµ½ÓÐÏ޵ĺòÑ¡ £¬¶øÒýÈëÁË Agent Öð²½Ì½Ë÷ºó £¬Äܹ»¿¼ÂǸü¹ãµÄ¹æÄ£ £¬¶¨Î»Ð§¹û¸üºÃ¡£ÔÚÎļþ¡¢Ä£¿é¡¢º¯ÊýÈý¸öÁ£¶ÈÉÏ £¬LocAgent È«ÃæÓâÔ½ÁË»ùÓÚ GPT-4o »ò Claude-3.5 µÄÏÖÓÐ Agent ϵͳ¡£Ê¹Óà Claude-3.5 ʱ £¬LocAgent ÔÚ SWE-Bench Lite Îļþ¼¶ Acc@5 µÖ´ï 94% £¬ÔÚº¯Êý¼¶¶¨Î»ÉÏͬÑùÓÅÓÚÆäËûÒªÁì¡£LocAgent ´îÅä Qwen2.5-32B (΢µ÷) Ä£Ð͵ÄÐÔÄÜÏÕЩÓë Claude-3.5 ³Öƽ£ºÔÚ SWE-Bench Lite Îļþ¼¶ Top-5 ׼ȷÂÊÉÏ £¬Ç°ÕßΪ 92.7% £¬ºóÕßÔ¼ 94.2% £¬²î±ðºÜС¡£¶øÈç¹ûʹÓà Qwen2.5-7B (΢µ÷) СģÐÍ £¬ËäȻ׼ȷÂÊÂÔÓÐϽµ£¨Ô¼ 88.3% £¬µ«ÈÔÁè¼Ý¾ø´ó´ó¶¼ baseline£© £¬ÆäÌåÏÖÒÑÄܹ»ÆÈ½ü GPT-4o µÄЧ¹û¡£

£¨2£©¶àÈÎÎñ³¡¾°Ïµķº»¯ÄÜÁ¦

ÓÉÓÚ SWE-Bench Lite Êý¾Ý¼¯¹ýÓÚÆ«ÖØ Bug ÀàÐÍ £¬ÍŶӴòÔìÁËеÄLoc-Bench»ù×¼ £¬ÓÃÓÚÈ«ÃæÆÀ¹ÀÒªÁìÔÚ¶àÑù»¯Èí¼þά»¤ÈÎÎñÖеĶ¨Î»ÄÜÁ¦¡£Loc-Bench ¹²°üÀ¨ 560 ¸öÕæÊµ GitHub issue £¬ÁýÕÖBug ÐÞ¸´¡¢¹¦Ð§ÐÂÔö¡¢Äþ¾²Â©¶´ÓëÐÔÄÜÓÅ»¯ËÄ´óÀà £¬ÈÎÎñÀàÐÍÔ½·¢¾ùºâ £¬Ìù½üʵ¼Ê¹¤³Ì³¡¾°¡£

ËÄ¡¢¿ªÔ´¸£Àû£ºÐ¡Ä£ÐÍÒ²ÄÜ´ò

Õâ¸öÑо¿×îÈÃÈËÐ˷ܵĵط½ÔÚÓÚ£º¿ªÔ´Ä£Ð;­¹ý΢µ÷ºó £¬Ò²ÄִܵïÉÌÓôóÄ£Ð͵ÄЧ¹û¡£ËûÃÇÌṩÁËÁ½¸ö°æ±¾ £¬1. Qwen2.5-7B ΢µ÷°æ£ºÐÔÄÜæÇÃÀ GPT-4o £¬µ¥´Î´¦Àí±¾Ç®½ö $0.05£»2.Qwen2.5-32B ΢µ÷°æ£ºÆÈ½ü Claude-3.5 ˮƽ £¬±¾Ç®½ÚÊ¡ 86%¡£Õâ¹ØÓÚÐèÒª´ó¹æÄ£°²ÅŵįóÒµÀ´Ëµ £¬Õâ¼òÖ±Êǽµ±¾ÔöЧµÄÉñÆ÷¡£

¾ßÌå¶øÑÔ £¬Î¢µ÷µÄ Qwen2.5-7B Ä£ÐÍ £¬LocAgent ÔÚ Loc-Bench ËÄÀೡ¾°ÏÂµÄÆ½¾ùÎļþ¼¶ Acc@5 Ϊ76.8% £¬º¯Êý¼¶ Acc@15 Ϊ46.9% £¬Òѽӽü SWE-Agent ´îÅä Claude-3.5 µÄÌåÏÖ£¨ºóÕߺ¯Êý¼¶Ô¼ 45.4%£©¡£½øÒ»²½½« LocAgent Óë Claude-3.5 ½áºÏºó £¬Îļþ¼¶Æ½¾ù׼ȷÂÊ¿ÉÌáÉýÖÁ81.1% £¬ÔÚËÄÀàÈÎÎñÖÐÏÕÐ©È«ÃæÓâÔ½ÆäËûÒªÁì¡£

Î塢ʵ¼ÊÓ¦Ó㺲»¿ÉÊǶ¨Î» £¬»¹ÄÜÖúÁ¦½â¾öÎÊÌâ

Ñо¿ÍŶÓÑéÖ¤ÁËÒ»¸öÒªº¦µã£º¸ü׼ȷµÄ´úÂ붨λֱ½ÓÌáÉýÎÊÌâ½â¾öÂÊ¡£ÔÚ GitHub ÎÊÌâ×Ô¶¯ÐÞ¸´ÈÎÎñÖÐ £¬Ê¹Óà LocAgent µÄ Pass@10 ÀÖ³ÉÂʱȻùÏßÒªÁìÌáÉýÁË 12%¡£ÕâÒâζ×ÅÕâÏî¼¼Êõ²»µ«½öÊǸö¡¸¶¨Î»¹¤¾ß¡¹ £¬¶øÊÇÄÜʵʵÔÚÔÚÌáÉýÕû¸öÈí¼þά»¤Á÷³ÌЧÂʵÄÀûÆ÷¡£

¸ÃÍŶӽøÒ»²½´Ó²î±ð½Ç¶ÈÕ¹¿ªÆÊÎö £¬Ì½ÌÖÆäÔÚÅÓ´óÈÎÎñÖеÄÎȶ¨ÐÔ¡¢±¾Ç®Ð§ÂÊ¡¢Òªº¦×é¼þ×÷ÓÃÒÔ¼°¶ÔÏÂÓÎÓ¦ÓõÄʵ¼Ê¼ÛÖµ¡£

£¨1£©ÄѶȷּ¶ÊµÑéÓë¶àÌøÂ³°ôÐÔ

ΪÁËÉîÈëÁ˽â LocAgent µÄÄÜÁ¦ £¬¸ÃÍŶӻ¹Æ¾¾ÝÈÎÎñµÄÄѶȶÔÐÔÄܽøÐÐÁËÆÊÎö¡£¸ÃÍŶӽ«¡¸ÄѶȡ¹ÓôúÂëͼÉϺ¯Êý¾àÀ루hop Êý£©À´È¨ºâ£º¼´ Issue ÃèÊöÖÐÌá¼°µÄº¯ÊýÓëʵ¼ÊÐèÒªÐ޸ĵĺ¯ÊýÖ®¼äµÄ×î¶Ì·¾¶¡£Ö±¹ÛµØËµ £¬hop=0 ÌåÏÖ Issue Ö±½ÓÌáµ½ÁËÐèÒª¸ÄµÄº¯ÊýÃû£»hop=1 ÌåÏÖÄ¿±êº¯ÊýÊÇ Issue ÖÐÌáµ½µÄº¯ÊýÖ®¼äÓÐÖ±½Ó¹ØÏµ £¬hop ÊýÔ½´óÔò¶¨Î»ÄѶÈÔ½¸ß¡£

ʵÑé·¢Ã÷£ºËæ×Å hop ÊýÔö¼Ó £¬ËùÓÐÒªÁìµÄ¶¨Î»×¼È·Âʶ¼ÔÚϽµ¡£¾¿¾¹¹ØÁªÔ½²»Ö±¹Û £¬Ä£ÐÍÐèÒªÍÆÀíµÄÁ´Â·¾ÍÔ½³¤¡£²»¹ý £¬²î±ðÒªÁìµÄ³°ôÐÔ²î±ðÃ÷ÏÔ£ºAgent ÀàÒªÁìÔÚ¸ßÄѶÈϵÄÐÔÄÜϽµ·ù¶ÈÃ÷ÏÔСÓÚ¼ìË÷ÀàÒªÁì¡£ÌرðÊÇ LocAgent ½èÖúͼ½á¹¹Ë÷Òý £¬ÔÚ hop ÊýÔö¼ÓʱÈÔÄܼá³ÖÏà¶Ô½Ï¸ßµÄ׼ȷÂÊ £¬ÌåÏÖ³ö½ÏºÃµÄ³°ôÐÔ¡£

Ïà±È֮Ϡ£¬¹Å°å¼ìË÷ÒªÁìÔÚÐèÒªÁ½ÌøÒÔÉÏʱÏÕЩʧЧ £¬ÔÚº¯Êý¼¶¶¨Î»ÉÏ×ÝȻĿ±êº¯ÊýÃû×־ͷºÆðÔÚÅÌÎÊÀï £¬ÓÐʱ¶¼ÕÒ²»µ½£¨ÒòΪËüÃÇÍùÍù°ÑÅÌÎʵ±×öÕûÌå £¬ÎÞ·¨²ð½â´¦Àíϸ½Ú£©¡£

£¨2£©Ð§¹ûÓ뱾Ǯ±È½Ï

½èÖú½á¹¹»¯Í¼Ë÷ÒýÓ빤¾ßŲÓà £¬LocAgent ½öÐè 6¡«9 ÂÖ½»»¥¼´¿ÉÍê³ÉÒ»´Î´úÂ붨λÈÎÎñ £¬ÍÆÀíÀú³Ì¸ßЧ¡£±ðµÄ £¬¸ÃÍŶÓÀûÓÿªÔ´Ä£ÐÍÈ¡µÃÁËæÇÃÀÉÌÓôóÄ£Ð͵Ľá¹û £¬Í¬Ê±´ó·ù½µµÍÍÆÀí±¾Ç® £¬¾ß±¸Êµ¼ÊÂ䵨°²ÅŵĿÉÐÐÐÔ¡£

¾ßÌåÀ´¿´ £¬Ê¹Óà Claude-3.5 µÈÉÌÓà API Ä£ÐÍʱ £¬Ã¿¸ö Issue µÄƽ¾ù´¦Àí±¾Ç®Ô¼Îª$0.66£»¶øÊ¹ÓÃÍâµØ°²ÅÅµÄ Qwen2.5-32B Ä£ÐÍ £¬±¾Ç®½µÖÁÔ¼$0.09 £¬½µµÍÁË86%¡£Èô½øÒ»²½½ÓÄÉ 7B µÄСģÐÍ £¬´¦Àí±¾Ç®¿ÉµÍÖÁ$0.05 £¬ÈÔÄܼá³ÖÓÅÓÚ´ó´ó¶¼ÒªÁìµÄÐÔÄÜ¡£´Óº¯Êý¼¶×¼È·ÂÊÓ뱾ǮµÄ±ÈÖµÀ´¿´ £¬Î¢µ÷ºóµÄQwen-2.5-7B ÊÇÐÔ¼Û±È×î¸ßµÄ¼Æ»® £¬ÆäЧÂÊÓÅÓÚËùÓÐÉÌÓÃÄ£ÐÍ£»Qwen-2.5-32B ´ÎÖ® £¬Ò²ÏÔÖøÓÅÓÚ Claude-3.5¡£Õâ±êÃ÷ £¬½áºÏ LocAgent ¿ò¼Ü £¬¿ªÔ´Ä£ÐͲ»µ«¾ß±¸ÐÔÄܾºÕùÁ¦ £¬¸ü¾ß°²Åž­¼ÃÐÔ¡£

£¨3£©Ó¦ÓÃЧ¹û£º¸ßÖÊÁ¿¶¨Î»ÏÔÖøÌáÉýÎÊÌâ½â¾öÂÊ

ΪÆÀ¹À´úÂ붨λÔÚʵ¼ÊÈí¼þά»¤ÈÎÎñÖеÄÓ°Ïì £¬¸ÃÍŶӽøÒ»²½ÆÊÎöÁË LocAgent ÔÚ×Ô¶¯½â¾ö GitHub ÎÊÌâÖеÄЧ¹û¡£½á¹û±êÃ÷ £¬Ëæ×Ŷ¨Î»×¼È·ÂʵÄÌáÉý £¬ÎÊÌâ½â¾öÀÖ³ÉÂÊÏÔÖøÌá¸ß £¬ËµÃ÷¸ü¾«×¼µÄ¶¨Î»½á¹ûÄܹ»ÏÔÖøÔöÇ¿×Ô¶¯»¯´úÂëÐ޸ĵÄÖÊÁ¿ÓëÎȶ¨ÐÔ¡£¸Ã·¢Ã÷ÑéÖ¤ÁË LocAgent ²»µ«ÔÚ¶¨Î»×Ô¼ºÌåÏÖÓÅÐã £¬Ò²ÄÜÓÐÐ§ÍÆ¶¯ÏÂÓÎÈÎÎñµÄÕûÌåÐÔÄÜ £¬¾ß±¸Êµ¼Ê¹¤³Ì¼ÛÖµ¡£

Áù¡¢¼¼ÊõÆôʾ£º½á¹¹»¯Ë÷Òý + ÖÇÄÜÍÆÀí

LocAgent µÄÀֳɽÒʾÁËÒ»¸öÖØÒªÇ÷ÊÆ£º´Ó¡¸±©Á¦ÅÌË㡹µ½¡¸ÖÇÄܾö²ß¡¹µÄ·¶Ê½×ª±ä¡£¹Å°åÒªÁìҪô°ÑÕû¸ö´úÂë¿âÖ±½Ó¶ª¸ø LLM ½øÐб©Á¦Æ¥Åä £¬ÒªÃ´Èà Agent ƾ¾ÝÔ¤Éè¹æÔòäĿ±éÀúĿ¼ £¬ÕâЩ¶¼ÊôÓÚ¡¸ÅÌËãÃܼ¯ÐÍ¡¹µÄ½â¾ö¼Æ»®¡£¶ø LocAgent ͨ¹ýͼË÷ÒýµÈ½á¹¹»¯ÖмäÌåÏÖ £¬½«ÅÓ´óÎÊÌâ½øÐнṹ»¯ÆÊÎö £¬È»ºóÈà LLM µ£¸º¸ü¸ßÌõÀíµÄÍÆÀíºÍ¾ö²ßÈÎÎñ¡£

ÕâÖÖ¡¸agentic retrieval¡¹·¶Ê½µÄ½¹µãÔÚÓÚ¾ö²ßÖÇÄÜ»¯¡£Í¨¹ýͼ¡¢Ê÷µÈ½á¹¹»¯ÖмäÌåÏÖ £¬ÐÅÏ¢±äµÃ¸üÒ×ÓÚÍÆÀí £¬Agent Äܹ»Æ¾¾Ý¾ßÌåÎÊÌ⶯̬µ÷½âËÑË÷Õ½ÂÔ £¬¶ø·ÇËÀ°åµØ×ñÑ­Ô¤Éè·¾¶¡£Õâ´ú±íÁË´Ó¡¸È˹¤Éè¼ÆÖÖÖÖ RAG pipeline¡¹Ïò¡¸Èà AI ×ÔÖ÷¾ö²ßÈçºÎ¼ìË÷¡¹µÄת±ä¡£

ÕâÖÖ½áºÏ½á¹¹»¯Ë÷ÒýÓë LLM ÖÇÄÜÌåЭͬÉè¼ÆµÄ·¶Ê½ £¬ºÜ¿ÉÄܳÉΪδÀ´ AI ¹¤³ÌÓ¦Óõıê׼ģʽ¡£²»ÔÙÊÇÈà LLM ×ö¸ü¶àÅÌËã £¬¶øÊÇÈà LLM ×ö¸üÖÇÄܵľö²ß - ³ÌÐòÔ±µÄ debugging ÌåÑéÓÖÒªÓ­À´Ò»´ÎÖØ´óÉý¼¶ÁË£¡

??ʱÊÂ1£ºsilk144

??06ÔÂ04ÈÕ,世界最大跨度三塔斜拉桥南主塔顺利封顶,

¡¡¡¡3.²»´øÎ¥½ûÎïÆ·ÈëУ¡£³ýÁ˰àÖ÷ÈÎÖ¸¶¨¼¸Î»Í¬Ñ§ÕÕÏàµÄ £¬¿ÉÒÔ´øÕÕÏà»ú £¬»òÕßÔÚ¼Ò³¤Í¬ÒâÇé¿öϰѸö°ÑÊÖ»úÈëУΪÕÕÏàÓÃ(°àÖ÷ÈÎÓë¼Ò³¤Ïàͬ) £¬ÆäËûͬѧһÂɲ»¿É´øÊÖ»úÈëУ¡£°àÖ÷ÈÎÒªÑÏ²é £¬Ò»¾­·¢Ã÷Ò»ÂÉûÊÕ¡£

,强行扒开腿❌狂揉❌玩视频¡£

??06ÔÂ04ÈÕ,河北省气象台发布雷电黄色预警信号,

¡¡¡¡Áí¼¸Í·¶À½ÇÂíÉÏ £¬»®·ÖÎ£×ø×ÅÁ½¸öÉÙÄêºÍÒ»ÃûÉÙÅ® £¬ÒÔ¼°Ò»¸öÄÐͯÓëÁ½ÃûŮͯ £¬¿´ÆðÀ´´Ï»ÛÁéÃô £¬¸ö¸ö³¤ÏàÆ¯ÁÁ¿É°®¡£

,大乳老师婬荡呻吟HD电影,91丨九色丨蝌蚪丨丝袜,忘穿内裤被同桌c了好爽小说¡£

??ʱÊÂ2£º13岁女孩穿三角裤体罚

??06ÔÂ04ÈÕ,东西问·名家坊丨白先勇:青春版《牡丹亭》何以青春?,

¡¡¡¡¶øÐ¡²»µãÔò¿ÞÁË £¬ËûÖªµÀ £¬ÀÏÈ˺ľ¡ÁË×îºóµÄ¾«ÆøÉñ £¬×ÝÈ»ÓÐÊ¥Ò©Ò²¾È²»»ØÁË¡£

,美女裸体㊙️无遮挡网站悟空,膀胱控制play排尿钢珠,jK黑色丝袜美女被❌视频网站¡£

??06ÔÂ04ÈÕ,天舟八号船箭组合体转运至发射区 将于近日择机发射,

¡¡¡¡ÈËÊÇÉç»áµÄÈË £¬À뿪ÁËÉç»á £¬ÈËÊÇÎÞ·¨Éú´æµÄ¡£Éç»áÒªÉú³¤ £¬ÆóÒµµÄТ¾´²»¿É»òȱ¡£ÆóÒµÒª¶ÔÇøÓòÉú³¤Ï×Ò»·ÝÁ¦ £¬ÒªÈÈÐĹ«ÒæÊÂÒµ £¬Ö§³ÖÇé¿ö±£»¤ £¬¶àÌṩ¾ÍÒµ¸Úλ £¬Ô츣һ·½ÀèÃñ £¬Ê÷Á¢Á¼ºÃµÄÉç»áÐÎÏó¡£×îÖÕ»áʹÉç»áÉϸü¶àµÄÈËÐû´«ÎÒÃÇ¡¢Ö§³ÖÎÒÃÇ £¬´Ó¶øÌá¸ßÎÒÃǹ«Ë¾µÄÖªÃû¶È £¬Ôö½øÆóÒµÉú³¤¡£

,91熟女丨PORNY丨桃花,被主人各种玩具姿势C到爆漫画,91涩情🍑🍑🍑¡£

??ʱÊÂ3£ºChinese女厕toilet偷拍

??06ÔÂ04ÈÕ,美国宣布向乌克兰提供新一批军事援助,

¡¡¡¡Ò»Î»×ÚÀÏÁ¢¼´±§ÆðʯÒã £¬Ñ¸ËÙÍËÈ´ £¬ÏÖÔÚµÄʯ×ÓÁêÍðÈôÒ»¸öħÍõ £¬Éñµ²É±Éñ·ðµ²ß±·ð £¬Òª±ÜÆäìͷ¡£

,爆乳3d亚洲人,美女裸体被❌涩涩漫画软件,班长🌿我~慢点~好爽好動漫¡£

??06ÔÂ04ÈÕ,印度学者:美西方正在做“最后的挣扎”,

¡¡¡¡¡°Ã»ÓÐʲôÎó»á £¬ÄãÊÇÏëɱÎÒÂ𣿡±Ð¡²»µãºÜÕò¶¨ £¬Õ¾ÔÚ½û¼É´óÕóµÄÖÐÐÄ £¬¿´×Åǰ·½ÄÇȺÈË £¬ÓÈÆäÊÇÕýÖеÄÄǸö¡£

,美女被❌狂揉大胸视频,欧美老年人靠比XXXXx,同性男男黄G片免费网站18禁无码¡£

??ʱÊÂ4£º老头脱精光洗澡Gay2023

??06ÔÂ04ÈÕ,财政部:一季度住宿和餐饮业税收增长44.7% 反映居民消费活力不断释放,

¡¡¡¡ÔÚдº¼Ñ½ÚÏéºÍµÄÆø·ÕÖÐ £¬ÎÒÃÇÓÖÓ­À´ÁËеÄһѧÆÚ¡£ÔÚÕâÀï £¬ÎÒÏÈ×£ÁÐλÀÏʦÉíÌ彡¿µ¡¢ÊÂÇé˳Àû £¬×£ÁÐλͬѧÌìÌ쿪ÐÄ¡¢Ñ§Ï°½ø²½¡£

,久久久夜色精品亚洲AV闺蜜,操小女B,女人小屄一级黄色视频,人C交2oo20乂❌真人收藏¡£

??06ÔÂ04ÈÕ,加强知识产权保护促进高效转化运用,

¡¡¡¡¡°ÊÇµÄ £¬ÓÌÈçÒ»Æ¬ÕæÊµµÄ¹ú¼Ò¡£¡±ÁøÊ÷»ØÓ¦µÀ¡£

,动漫魅魔裸体秘无遮挡,美女脱一光二净打屁股,性欧美孕妇孕交XXOO¡£

Ôð±à£º奥托·布劳恩

ÉóºË£º周晓平

Ôð±à£º王爱萍

Ïà¹ØÍÆ¼ö »»Ò»»»

Copyright (C) 2001-   dzwww.com. All Rights Reserved

ÐÂÎÅÐÅϢЧÀÍÐí¿ÉÖ¤ - ÒôÏñÖÆÆ·³öÊéÐí¿ÉÖ¤ - ¹ã²¥µçÊÓ½ÚÄ¿ÖÆ×÷¾­ÓªÐí¿ÉÖ¤ - ÍøÂçÊÓÌýÐí¿ÉÖ¤ - ÍøÂçÎÄ»¯¾­ÓªÐí¿ÉÖ¤

ɽ¶«Ê¡»¥ÁªÍø´«Ã½¼¯ÍÅÖ÷°ì  ÁªÏµµç»°£º0531-85193202  Î¥·¨²»Á¼ÐÅÏ¢¾Ù±¨µç»°£º0531-85196540

³ICP±¸09023866ºÅ-1   ³¹«Íø°²±¸ 37010202000111ºÅ  

Copyright (C) 2001- Dzwww   ³ICP±¸09023866ºÅ-1

ÍøÕ¾µØÍ¼