¿­·¢ÌìÉúÓ®¼ÒÒ»´¥¼´·¢Ê×Ò³

ËÑË÷ èÑÛÓ°Ï· ÈÚýÌ徨Õó
  • ɽ¶«ÊÖ»ú±¨

  • èÑÛÓ°Ï·

  • ¹«¹²Íø¹Ù·½Î¢ÐÅ

  • ¹«¹²Íø¹Ù·½Î¢²©

  • ¶¶Òô

  • ÈËÃñºÅ

  • È«¹úµ³Ã½Æ½Ì¨

  • ÑëÊÓÆµ

  • °Ù¼ÒºÅ

  • ¿ìÊÖ

  • Í·ÌõºÅ

  • ßÙÁ¨ßÙÁ¨

Ê×Ò³ >ÐÂÎÅ >Éç»áÐÂÎÅ

Claude 3.5¡¢±¾Ç®½µµÍ86% £¬¿ªÔ´´úÂ붨λÐÂÉñÆ÷LocAgentÀ´ÁË

2025-06-01 18:58:12
À´Ô´£º

èÑÛÓ°Ï·

×÷Õߣº

范稚莲

ÊÖ»ú¼ì²ì

¡¡¡¡Ã¨ÑÛÓ°Ï·¼ÇÕß 胡梅尔斯 ±¨µÀw3u7903ejky2ywls

ÓÖÊÇÒ»¸öÈóÌÐòÔ±¿ñ»¶µÄÑо¿£¡À´×Ô OpenHands¡¢Ò®Â³¡¢ÄϼӴóºÍ˹̹¸£µÄÑо¿ÍŶӸոÕÐû²¼ÁËLocAgent¡ª¡ª Ò»¸öרÃÅÓÃÓÚ´úÂ붨λµÄͼË÷Òý LLM Agent ¿ò¼Ü £¬Ö±½Ó°Ñ´úÂ붨λ׼ȷÂÊÀ­µ½ÁË 92.7% µÄи߶È¡£¸ÃÑо¿Òѱ» ACL 2025 ¼Óá£

ÂÛÎÄÌâÄ¿£ºLocAgent: Graph-Guided LLM Agents for Code LocalizationÂÛÎÄÁ´½Ó£ºhttps://arxiv.org/abs/2503.09089´úÂëÁ´½Ó£ºhttps://github.com/gersteinlab/LocAgent

Ò»¡¢Í´µãºÜÕæÊµ£º´úÂ붨λ¾¿¾¹ÓжàÔÖ £¿

ÏàÐÅÿ¸ö³ÌÐòÔ±¶¼ÓйýÕâÑùµÄ¾­Àú£º¿´µ½Ò»¸ö bug ±¨¸æ £¬ÂúÁ³ÎʺŵØÏ롸Õ⾿¾¹Òª¸ÄÄÄÀï £¿¡¹¡£¹Å°åÒªÁìҪô¿¿Òªº¦´ÊÆ¥Å䣨̫´Ö²Ú£© £¬ÒªÃ´Ö±½Ó°ÑÕû¸ö´úÂë¿â¶ª¸ø LLMs£¨Ì«µÍЧ£© £¬ÒªÃ´Èà Agent äĿ±éÀúĿ¼£¨Ì«ð¯×¾£©¡£

ÎÊÌâµÄ½¹µãÔÚÓÚ£º×ÔÈ»ÓïÑÔÃèÊöµÄÎÊÌâºÍÕæÕýÐèÒªÐÞ¸´µÄ´úÂëλÖÃÖ®¼ä £¬ÍùÍù¸ôןü¸²ãŲÓùØÏµ¡£ºÃ±ÈÓû§·´Ï졸XSS ©¶´¡¹ £¬µ«Êµ¼ÊÐèÒªÐ޸ĵĿÉÄÜÊÇij¸öÉî²ãµÄÑéÖ¤¹¤¾ßº¯Êý¡£

»»ÑÔÖ® £¬´úÂ붨λָµÄÊÇÔÚ´óÐÍ´úÂë¿âÖо«È·ÕÒµ½ÐèÒªÐ޸ĵĴúÂëλÖà £¬ÔÚÈí¼þ¿ª·¢Óëά»¤ÖÐ £¬×¼È·µØ¶¨Î»´úÂëÎÊÌâÊÇÌá¸ß¿ª·¢Ð§ÂʵÄÒªº¦£¨Í¼ 1 չʾÁËËÄÖÖ³£¼ûµÄ´úÂëÐÞ¸´³¡¾°£©¡£

ͼ 1£º¸ø¶¨Ò»¸ö´úÂë¿â£¨×󣩺ÍÎÊÌâÃèÊö£¨ÖÐ £¬°üÀ¨ËÄÖÖ³¡¾°µÄʾÀý£© £¬´úÂ붨λÐèҪʶ±ð³öÐèÒªÐ޸ĵÄÏà¹Ø´úÂëλÖã¨ÓÒ£© £¬°üÀ¨¾ßÌåµÄÎļþ¡¢ÀàºÍº¯Êý¡£LocAgent Ö¼ÔÚÈà LLM Agent ×Ô¶¯Íê³ÉÕâÒ»Àú³Ì¡£

×ÔÈ»ÓïÑÔÖеÄÎÊÌâÃèÊö£¨Èç¹ýʧ±¨¸æ£©ÍùÍùÓëÕæÕýµÄ¹ÊÕϸùÒò±£´æÏÔÖøµÄÓïÒå²î±ðÓë½á¹¹¾àÀ루Èçͼ 2 Ëùʾ£©¡£Õâ²»µ«ÒªÇóÄ£ÐÍÄܹ»ÉîÈëÀí½â×ÔÈ»ÓïÑÔ±àдµÄ¹ýʧ±¨¸æ £¬»¹Ðè¾ß±¸ÔÚÅÓ´ó´úÂë¿âÖпçÔ½²ã¼¶½á¹¹ºÍÅÓ´óÒÀÀµ¹ØÏµ½øÐÐÍÆÀíºÍ×·×ÙµÄÄÜÁ¦¡£

ͼ 2: ͼÖкìÉ«½ÚµãÌåÏÖÎÊÌâÃèÊöÖÐÃ÷È·Ìá¼°µÄº¯Êý £¬»ÆÉ«½ÚµãÌåÏÖʵ¼ÊÐèÒªÐ޸ģ¨ÐÞ²¹£©µÄº¯Êý¡£ÈÎÎñÄѶȽç˵Ϊ´úÂëͼÖдÓÌá¼°º¯Êýµ½Ä¿±êÐÞ²¹º¯ÊýÖ®¼äµÄ×î¶Ì·¾¶³¤¶È£¨×îÉÙÌøÊý£© £¬Í¼Ê¾ÀýÖÐÈÎÎñÄѶÈΪ 2 Ìø¡£

¶þ¡¢LocAgent£º¸ø LLM ×°ÉÏ¡¸´úÂëµØÍ¼¡¹

¸ÃÑо¿ÍŶӵĽâ¾ö¼Æ»®Ï൱ÇÉÃÊ×ÏÈËûÃǰÑÕû¸ö´úÂë¿â½âÎö³ÉÒ»ÕÅͼ £¬°üÀ¨Îļþ¡¢Àà¡¢º¯ÊýÖ®¼äµÄ°üÀ¨¡¢Å²ÓᢼÌÐø¡¢µ¼Èë¹ØÏµ¡£È»ºó¸ÃÍŶÓΪ LLM Agent Ìṩ¼ò½àͳһµÄͼԭÓï½Ó¿Ú £¬ÒÔÖ§³ÖÀëЧ̽Ë÷´úÂë¿â¡£¸ÃÒªÁìͨ¹ý½«´úÂë¿â½âÎöΪÒ칹ͼÌåÏÖ £¬ÈôóÓïÑÔÄ£ÐÍÄܹ»ÏñʹÓõØÍ¼Ò»Ñù¸ßЧµØÔÚ´úÂëÖС¸Òƶ¯¡¹ £¬ÊµÏÖ¶àÌøÍÆÀí £¬Öð²½½Ó½üÄ¿±ê´úÂë¡£

ͼ 3£ºLocAgent ¿ò¼Ü¸ÅÀÀ

Èçͼ 3 Ëùʾ £¬LocAgent Ê×ÏȽ«´úÂë¿â½âÎöΪһ¸öÒ칹ͼÌåÏÖ £¬Í¼ÖаüÀ¨¶àÖÖÀàÐ͵ĴúÂëʵÌå¼°ÆäÒÀÀµ¹ØÏµ¡£ÔÚ´Ë»ù´¡ÉÏ £¬ÏµÍ³¹¹½¨ÁË·Ö²ãÏ¡ÊèË÷Òý £¬ÓÃÓÚÖ§³Ö¸ßЧµÄÄÚÈݼìË÷Óë½á¹¹»¯Ì½Ë÷¡£½èÖúÕâЩË÷Òý £¬LocAgent Äܹ»½áºÏͼ½á¹¹Ó빤¾ß½Ó¿Ú £¬Ö´ÐÐÓÉ Agent Çý¶¯µÄÖð²½ËÑË÷Àú³Ì £¬¾«×¼Íê³É´úÂ붨λÈÎÎñ¡£

2.1 ´úÂëÌåÏÖ¹¹½¨Àú³Ì

´úÂëͼÌåÏÖ¹¹½¨£ºÎªÍ³Ò»ÌåÏÖ´úÂë¿âÖеĽṹÓëÓïÒåÐÅÏ¢ £¬LocAgent »ùÓÚÁýͳÓï·¨Ê÷£¨AST£© ¶Ô´úÂë¿â½øÐнâÎö £¬¹¹½¨Ò»¸öÒì¹¹ÓÐÏòͼ ×÷Ϊ½á¹¹»¯Ë÷Òý £¬ÏêϸÌåÏÖÁË´úÂëĿ¼¡¢Îļþ¡¢Àà¡¢º¯ÊýÖ®¼äµÄ°üÀ¨¡¢Å²Óᢵ¼ÈëºÍ¼ÌÐø¹ØÏµ £¬Ê¹µÃÒþʽÒÀÀµÏÔÐÔ»¯ £¬±ãÓÚ LLM ¸ßÐ§ÍÆÀí¡£

ÕâÖÖͼ½á¹¹µÄÓÅÊÆÔÚÓÚ£º×ÝÈ»Á½¸ö´úÂëÆ¬¶Î·Ö´¦²î±ðÄ £¿é £¬Ö»Òª±£´æÅ²Óûò¼ÌÐø¹ØÏµ £¬ÔÚͼÉÏËüÃǾͻá±äµÃ¡¸ÁÚ½ü¡¹¡£ºÃ±È £¬ÒÔÍù»ùÓÚĿ¼µ¼º½µÄÒªÁì»áÈÏΪԶ¸ôÁ½¸ö×ÓĿ¼µÄÄ £¿é¾ø²»Ïà¸É £¬µ«Èç¹ûÄ £¿é A º¯ÊýŲÓÃÁËÄ £¿é B £¬ÔÚ LocAgent µÄͼÖÐ A ºÍ B »áͨ¹ýŲÓñßÖ±½ÓÁ¬½Ó £¬Ê¹ËüÃÇÔÚ¸Ãͼ½á¹¹ÉÏ¿¿½ü¡£¹ØÓÚ´úÂ붨λÈÎÎñ £¬ÕâÖÖ¡¸ÁÚ½ü¡¹ÖÁ¹ØÖØÒª £¬ÒòΪÐí¶àÎÊÌâ²»ÊǾÖÏÞÔÚµ¥¸öÎļþ¼ÐÄÚ²¿ £¬¶øÊÇͨ¹ýŲÓÃÁ´¿çÔ½¶à¸öÄ £¿é¡£

2.2 Ìṩ¹¤¾ß½Ó¿Ú¹© Agent ÅÌÎÊ

¹¹½¨ºÃ´úÂëͼºó £¬LocAgent ÌṩÁËͳһµÄ¹¤¾ß½Ó¿Ú £¬Èà LLM Agent Äܹ»±ãµ±ÍÁµØÎÊͼ½á¹¹ºÍ´úÂëÄÚÈÝ¡£Ö÷Òª°üÀ¨ÒÔÏÂÈý¸ö API£º

SearchEntity£º¸Ã¹¤¾ß»ùÓÚÌõÀí»¯ÊµÌåË÷Òý £¬Ê¹ÓÃÒªº¦´ÊËÑË÷´úÂë¿âÖÐÏà¹ØÊµÌå¡£µ±ÔÚÉϲãË÷ÒýÖÐδÄÜÕÒµ½Æ¥ÅäÏîʱ £¬ÏµÍ³»á×Ô¶¯Ê¹ÓÃÏÂÒ»²ãË÷Òý½øÐÐËÑË÷ £¬´Ó¾«È·Æ¥Å䵽ģºýËÑË÷ £¬ÒÔ²éÕÒ×î½Ó½üµÄÆ¥ÅäÏî¡£¹ØÓÚ¼ìË÷µ½µÄÿ¸öʵÌå £¬SearchEntity »á·µ»Ø¸Ã´úÂëÆ¬¶ÎµÄÕªÒª£¨Èçͼ 4 £¬ÓÐÕÛµþ¼¶±ð¡¢Ô¤ÀÀ¼¶±ðºÍÍêÕû´úÂëÈý¼¶ £¬¿Éƾ¾ÝÐèÒªÕ¹¿ª£©¡£

ͼ 4: Ϊ¸ßЧµÄ Agent ´úÂë½»»¥¶øÉè¼ÆµÄ²î±ðÊäÌØ±ðʽʾÀý¡£

RetrieveEntity£ºµ± Agent È·¶¨ÁËij¸ö´úÂëʵÌåºÜ¿ÉÄܾÍÊÇÄ¿±êʱ £¬¿ÉÒÔÓô˹¤¾ßÌáÈ¡¸ÃʵÌåµÄÍêÕûÐÅÏ¢¡£µ±ÊäÈëʵÌå ID £¬RetrieveEntity Êä³ö¸ÃʵÌåµÄÎļþ·¾¶¡¢ÆðÖ¹Ðкš¢ÍêÕû´úÂëÄÚÈݵÈÏêϸÊôÐÔ¡£TraverseGraph£º¸Ã¹¤¾ßÔÚ´úÂëͼÉÏÖ´ÐÐÀàÐ͸ÐÖªµÄ¹ã¶ÈÓÅÏÈËÑË÷¡£Agent ¿ÉÒÔÖ¸¶¨ÆðʼµÄʵÌå ID £¬ÒÔ¼°Ï£Íû±éÀúµÄÆ«Ïò¡¢²½Êý£¨hops£©¡¢ÊµÌåÀàÐͺ͹ØÏµÀàÐ͵ȲÎÊý¡£¹¤¾ß»áÔÚͼÖÐ´ÓÆðµã³ö·¢Æ¾¾ÝÒªÇó×ßÖ¸¶¨²½Êý £¬·µ»Ø±éÀúµ½µÄ×Óͼ½á¹¹¡£Í¨¹ýÉèÖòî±ðµÄÀàÐ͹ýÂË £¬Agent ¿ÉÒÔÁé»îµØÌ½Ë÷ºÃ±È¡¸ÑØÅ²ÓùØÏµÏòÏÂ×·×ÙÁ½²½¡¹»ò¡¸¼ì²ì´ÓijÀà³ö·¢µÄ¼ÌÐøÌõÀí¡¹µÈµÈ¡£ÖµµÃÒ»ÌáµÄÊÇ £¬TraverseGraph ½«·µ»ØµÄ×Óͼ»¨Ñù»¯³ÉÒ»ÖÖÊ÷×´½á¹¹Îı¾£¨¼ûͼ 5£© £¬ÒÔ±ã LLM ¸üÈÝÒ×Àí½â¹ØÏµÍØÆË¡£

ͼ 5£ºTraverseGraph ¹¤¾ßÊä³öʾÀý¡£

2.3 Agent Çý¶¯µÄÍÆÀí½×¶Î

LocAgent ÔÚÌáʾÉè¼ÆÉϽÓÄÉÁË¡¸Öð²½Ë¼¿¼¡¹(Chain-of-Thought, CoT) µÄÕ½ÂÔ £¬Òýµ¼ LLM Agent ½«´úÂ붨λÈÎÎñÆÊÎöΪһϵÁа취 £¬Ä£ÄâÈËÀàµ÷ÊÔ˼·һ²½²½ÆÈ½üÄ¿±ê¡£Õû¸öÎÊÌâÇó½âÀú³Ì¿ÉÒÔ¸ÅÀ¨ÎªÒÔϽ׶Σº

ÎÊÌâÀí½âÓëÒªº¦´ÊÌáÈ¡£ºAgent Ê×ÏȶÔÊäÈëµÄ issue ÃèÊö½øÐÐÆÊÎö £¬»®·Ö³ö²î±ð·½ÃæµÄÐÅÏ¢ £¬È»ºóÌáÈ¡³öÓëÎÊÌâÏà¹ØµÄÒªº¦´Ê¡£ÕâЩҪº¦´ÊÏ൱ÓÚΪºóÐøËÑË÷Ö¸Ã÷ÁË¿ª¶ËÆ«Ïò¡£Á´½ÓÒªº¦´Êµ½´úÂëʵÌ壺Õë¶Ôÿ¸öÌáÈ¡µÄÒªº¦´Ê £¬Agent ŲÓà SearchEntity ¹¤¾ßÔÚ´úÂëË÷ÒýÖвéÕÒÆ¥ÅäµÄ´úÂëʵÌå¡£¶àÌøÍÆÀí £¬Éú³É¹ÊÕÏÁ´Â·£º½ÓÏÂÀ´ £¬Agent »áʵÑé´®ÁªÏßË÷ £¬´Ó±¨´í±íÕ÷ÍÆµ¼¹ÊÕÏÔ­Òò¡£ËüÏÈÈ·¶¨ÎÊÌâ´¥·¢µÄ³õʼÈë¿Úµã£¨ÀýÈç´¥·¢¹ýʧµÄ API »òº¯Êý£© £¬È»ºóÒÔÕâЩµãΪÆðµã £¬ÔÚ´úÂëͼÉϽøÐеü´ú̽Ë÷£ºÅ²Óà TraverseGraph ÑØÅ²ÓùØÏµ»òÒÀÀµ¹ØÏµÏòÏà¹ØÆ«ÏòËÑË÷ £»Óà RetrieveEntity ¼ì²ìijЩҪº¦½ÚµãµÄʵÏÖϸ½Ú £»ÐëҪʱÔÙ´Î SearchEntity ÒýÈëеÄÒªº¦´Ê¡£Í¨¹ý¶àÂÖ½»ÌæÊ¹ÓÃÕâЩ¹¤¾ß £¬Agent Öð²½¹¹½¨ÆðÒ»Ìõ´ÓÎÊÌâÖ¢×´µ½Ç±ÔÚ¸ùÒòµÄÂß¼­Â·¾¶¡£Ëø¶¨Ä¿±ê´úÂ룺ÔÚÐγɶÔÎÊÌâµÄÈ«ÃæÀí½âºó £¬Agent ƾ¾Ý¡¸¹ÊÕÏÁ´Â·¡¹ÖÐ̻¶µÄ¿ÉÒÉ»·½Ú £¬¶¨Î»³öËùÓпÉÄÜÐèÒªÐ޸ĵÄÄ¿±ê´úÂëʵÌ壨¿ÉÄÜÊÇÈô¸É¸öº¯Êý»òÀࣩ¡£Ëæºó £¬Agent ¶ÔÕâЩºòѡʵÌå°´Ïà¹ØÐÔ½øÐÐÅÅÐòÊä³ö £¬²¢¸ø³öËüÃǵÄÎļþ·¾¶ÒÔ¼°¼òÒªµÄÔ­Òò˵Ã÷¡£

Õû¸ö LocAgent µÄʹÓöÔÓû§À´ËµºÜÊǼò½à£ºÖ»ÐèÊäÈë×ÔÈ»ÓïÑÔµÄÎÊÌâÃèÊö £¬ LLM Agent ¾Í»áÈçÉÏËùÊö×ÔÖ÷µØÍê³ÉһϵÁÐËÑË÷¡¢±éÀú¡¢¶ÁÈ¡²Ù×÷ £¬×îºó¸ø³ö´úÂ붨λ½á¹û¡£

Èý¡¢ÊµÑé½á¹û£ºÕæÏ㾯¸æ

LocAgent ÔÚÕæÊµÊý¾Ý¼¯ÉϵÄÌåÏÖºÍÆÊÎö½á¹ûÁîÈËÖõÄ¿¡£Ñо¿ÖÐʹÓÃÁ˼ÈÓеĻù×¼Êý¾Ý¼¯£¨SWE-Bench Lite£©ÒÔ¼°ÍŶÓй¹½¨µÄ Loc-Bench £¬±ÈÕÕÁ˶àÖÖ»ùÏßÒªÁìµÄ´úÂ붨λЧ¹û¡£

£¨1£©´úÂ붨λЧ¹û¾«²Ê

SWE-Bench Lite ÊÇ´Ó GitHub issue Öй¹½¨µÄ»õ²Ö¼¶´úÂëÐÞ¸´Êý¾Ý¼¯ £¬Ò²³£ÓÃÓÚ´úÂ붨λÆÀ¹À £¬°üÀ¨ 300 ¸öÎÊÌâ¼°Æä¶ÔÓ¦µÄÐÞ¸´´úÂë £¬ÆäÖд󲿷ÖΪ bug ±¨¸æ¡ £»ùÓڸûù×¼ £¬LocAgent ʵÏÖÁËĿǰ×îÓŵĴúÂ붨λ׼ȷÂÊ £¬ÏÔÖøÓÅÓÚÏÖÓÐÒªÁì¡£

Ïà±È¹Å°åµÄÏòÁ¿¼ìË÷ÒªÁìÓÐÏÔÖøÌáÉý£ºBM25 ÔÚÎļþ¼¶ Acc@5 ÉϽöΪ 61.7% £¬¶øÏȽøµÄ´úÂëǶÈëÄ£ÐÍÈç CodeRankEmbed Ò²½öµÖ´ï 84.7% £»¶ø LocAgent ׼ȷÂʸߴï 92.7% £¬ÔÚº¯Êý¼¶¶¨Î»ÖÐҲͬÑùÏÔÖøÓÅÓÚÕâЩҪÁì¡£¶à²½ÍÆÀíµÄ Agent ÀàÒªÁìÕûÌåÉÏʤ¹ý»ùÓÚÀιÌÁ÷³ÌµÄÒªÁì¡ £»ùÓÚÀιÌÁ÷³ÌµÄÒªÁ죨Èç Agentless£©ÍùÍùÖ»ÄÜÒÀ¾Ý×ÖÃæÆ¥ÅäÕÒµ½ÓÐÏ޵ĺòÑ¡ £¬¶øÒýÈëÁË Agent Öð²½Ì½Ë÷ºó £¬Äܹ»¿¼ÂǸü¹ãµÄ¹æÄ£ £¬¶¨Î»Ð§¹û¸üºÃ¡£ÔÚÎļþ¡¢Ä £¿é¡¢º¯ÊýÈý¸öÁ£¶ÈÉÏ £¬LocAgent È«ÃæÓâÔ½ÁË»ùÓÚ GPT-4o »ò Claude-3.5 µÄÏÖÓÐ Agent ϵͳ¡£Ê¹Óà Claude-3.5 ʱ £¬LocAgent ÔÚ SWE-Bench Lite Îļþ¼¶ Acc@5 µÖ´ï 94% £¬ÔÚº¯Êý¼¶¶¨Î»ÉÏͬÑùÓÅÓÚÆäËûÒªÁì¡£LocAgent ´îÅä Qwen2.5-32B (΢µ÷) Ä£Ð͵ÄÐÔÄÜÏÕЩÓë Claude-3.5 ³Öƽ£ºÔÚ SWE-Bench Lite Îļþ¼¶ Top-5 ׼ȷÂÊÉÏ £¬Ç°ÕßΪ 92.7% £¬ºóÕßÔ¼ 94.2% £¬²î±ðºÜС¡£¶øÈç¹ûʹÓà Qwen2.5-7B (΢µ÷) СģÐÍ £¬ËäȻ׼ȷÂÊÂÔÓÐϽµ£¨Ô¼ 88.3% £¬µ«ÈÔÁè¼Ý¾ø´ó´ó¶¼ baseline£© £¬ÆäÌåÏÖÒÑÄܹ»ÆÈ½ü GPT-4o µÄЧ¹û¡£

£¨2£©¶àÈÎÎñ³¡¾°Ïµķº»¯ÄÜÁ¦

ÓÉÓÚ SWE-Bench Lite Êý¾Ý¼¯¹ýÓÚÆ«ÖØ Bug ÀàÐÍ £¬ÍŶӴòÔìÁËеÄLoc-Bench»ù×¼ £¬ÓÃÓÚÈ«ÃæÆÀ¹ÀÒªÁìÔÚ¶àÑù»¯Èí¼þά»¤ÈÎÎñÖеĶ¨Î»ÄÜÁ¦¡£Loc-Bench ¹²°üÀ¨ 560 ¸öÕæÊµ GitHub issue £¬ÁýÕÖBug ÐÞ¸´¡¢¹¦Ð§ÐÂÔö¡¢Äþ¾²Â©¶´ÓëÐÔÄÜÓÅ»¯ËÄ´óÀà £¬ÈÎÎñÀàÐÍÔ½·¢¾ùºâ £¬Ìù½üʵ¼Ê¹¤³Ì³¡¾°¡£

ËÄ¡¢¿ªÔ´¸£Àû£ºÐ¡Ä£ÐÍÒ²ÄÜ´ò

Õâ¸öÑо¿×îÈÃÈËÐ˷ܵĵط½ÔÚÓÚ£º¿ªÔ´Ä£Ð;­¹ý΢µ÷ºó £¬Ò²ÄִܵïÉÌÓôóÄ£Ð͵ÄЧ¹û¡£ËûÃÇÌṩÁËÁ½¸ö°æ±¾ £¬1. Qwen2.5-7B ΢µ÷°æ£ºÐÔÄÜæÇÃÀ GPT-4o £¬µ¥´Î´¦Àí±¾Ç®½ö $0.05 £»2.Qwen2.5-32B ΢µ÷°æ£ºÆÈ½ü Claude-3.5 ˮƽ £¬±¾Ç®½ÚÊ¡ 86%¡£Õâ¹ØÓÚÐèÒª´ó¹æÄ£°²ÅŵįóÒµÀ´Ëµ £¬Õâ¼òÖ±Êǽµ±¾ÔöЧµÄÉñÆ÷¡£

¾ßÌå¶øÑÔ £¬Î¢µ÷µÄ Qwen2.5-7B Ä£ÐÍ £¬LocAgent ÔÚ Loc-Bench ËÄÀೡ¾°ÏÂµÄÆ½¾ùÎļþ¼¶ Acc@5 Ϊ76.8% £¬º¯Êý¼¶ Acc@15 Ϊ46.9% £¬Òѽӽü SWE-Agent ´îÅä Claude-3.5 µÄÌåÏÖ£¨ºóÕߺ¯Êý¼¶Ô¼ 45.4%£©¡£½øÒ»²½½« LocAgent Óë Claude-3.5 ½áºÏºó £¬Îļþ¼¶Æ½¾ù׼ȷÂÊ¿ÉÌáÉýÖÁ81.1% £¬ÔÚËÄÀàÈÎÎñÖÐÏÕÐ©È«ÃæÓâÔ½ÆäËûÒªÁì¡£

Î塢ʵ¼ÊÓ¦Ó㺲»¿ÉÊǶ¨Î» £¬»¹ÄÜÖúÁ¦½â¾öÎÊÌâ

Ñо¿ÍŶÓÑéÖ¤ÁËÒ»¸öÒªº¦µã£º¸ü׼ȷµÄ´úÂ붨λֱ½ÓÌáÉýÎÊÌâ½â¾öÂÊ¡£ÔÚ GitHub ÎÊÌâ×Ô¶¯ÐÞ¸´ÈÎÎñÖÐ £¬Ê¹Óà LocAgent µÄ Pass@10 ÀÖ³ÉÂʱȻùÏßÒªÁìÌáÉýÁË 12%¡£ÕâÒâζ×ÅÕâÏî¼¼Êõ²»µ«½öÊǸö¡¸¶¨Î»¹¤¾ß¡¹ £¬¶øÊÇÄÜʵʵÔÚÔÚÌáÉýÕû¸öÈí¼þά»¤Á÷³ÌЧÂʵÄÀûÆ÷¡£

¸ÃÍŶӽøÒ»²½´Ó²î±ð½Ç¶ÈÕ¹¿ªÆÊÎö £¬Ì½ÌÖÆäÔÚÅÓ´óÈÎÎñÖеÄÎȶ¨ÐÔ¡¢±¾Ç®Ð§ÂÊ¡¢Òªº¦×é¼þ×÷ÓÃÒÔ¼°¶ÔÏÂÓÎÓ¦ÓõÄʵ¼Ê¼ÛÖµ¡£

£¨1£©ÄѶȷּ¶ÊµÑéÓë¶àÌøÂ³°ôÐÔ

ΪÁËÉîÈëÁ˽â LocAgent µÄÄÜÁ¦ £¬¸ÃÍŶӻ¹Æ¾¾ÝÈÎÎñµÄÄѶȶÔÐÔÄܽøÐÐÁËÆÊÎö¡£¸ÃÍŶӽ«¡¸ÄѶȡ¹ÓôúÂëͼÉϺ¯Êý¾àÀ루hop Êý£©À´È¨ºâ£º¼´ Issue ÃèÊöÖÐÌá¼°µÄº¯ÊýÓëʵ¼ÊÐèÒªÐ޸ĵĺ¯ÊýÖ®¼äµÄ×î¶Ì·¾¶¡£Ö±¹ÛµØËµ £¬hop=0 ÌåÏÖ Issue Ö±½ÓÌáµ½ÁËÐèÒª¸ÄµÄº¯ÊýÃû £»hop=1 ÌåÏÖÄ¿±êº¯ÊýÊÇ Issue ÖÐÌáµ½µÄº¯ÊýÖ®¼äÓÐÖ±½Ó¹ØÏµ £¬hop ÊýÔ½´óÔò¶¨Î»ÄѶÈÔ½¸ß¡£

ʵÑé·¢Ã÷£ºËæ×Å hop ÊýÔö¼Ó £¬ËùÓÐÒªÁìµÄ¶¨Î»×¼È·Âʶ¼ÔÚϽµ¡£¾¿¾¹¹ØÁªÔ½²»Ö±¹Û £¬Ä£ÐÍÐèÒªÍÆÀíµÄÁ´Â·¾ÍÔ½³¤¡£²»¹ý £¬²î±ðÒªÁìµÄ³°ôÐÔ²î±ðÃ÷ÏÔ£ºAgent ÀàÒªÁìÔÚ¸ßÄѶÈϵÄÐÔÄÜϽµ·ù¶ÈÃ÷ÏÔСÓÚ¼ìË÷ÀàÒªÁì¡£ÌرðÊÇ LocAgent ½èÖúͼ½á¹¹Ë÷Òý £¬ÔÚ hop ÊýÔö¼ÓʱÈÔÄܼá³ÖÏà¶Ô½Ï¸ßµÄ׼ȷÂÊ £¬ÌåÏÖ³ö½ÏºÃµÄ³°ôÐÔ¡£

Ïà±È֮Ϡ£¬¹Å°å¼ìË÷ÒªÁìÔÚÐèÒªÁ½ÌøÒÔÉÏʱÏÕЩʧЧ £¬ÔÚº¯Êý¼¶¶¨Î»ÉÏ×ÝȻĿ±êº¯ÊýÃû×־ͷºÆðÔÚÅÌÎÊÀï £¬ÓÐʱ¶¼ÕÒ²»µ½£¨ÒòΪËüÃÇÍùÍù°ÑÅÌÎʵ±×öÕûÌå £¬ÎÞ·¨²ð½â´¦Àíϸ½Ú£©¡£

£¨2£©Ð§¹ûÓ뱾Ǯ±È½Ï

½èÖú½á¹¹»¯Í¼Ë÷ÒýÓ빤¾ßŲÓà £¬LocAgent ½öÐè 6¡«9 ÂÖ½»»¥¼´¿ÉÍê³ÉÒ»´Î´úÂ붨λÈÎÎñ £¬ÍÆÀíÀú³Ì¸ßЧ¡£±ðµÄ £¬¸ÃÍŶÓÀûÓÿªÔ´Ä£ÐÍÈ¡µÃÁËæÇÃÀÉÌÓôóÄ£Ð͵Ľá¹û £¬Í¬Ê±´ó·ù½µµÍÍÆÀí±¾Ç® £¬¾ß±¸Êµ¼ÊÂ䵨°²ÅŵĿÉÐÐÐÔ¡£

¾ßÌåÀ´¿´ £¬Ê¹Óà Claude-3.5 µÈÉÌÓà API Ä£ÐÍʱ £¬Ã¿¸ö Issue µÄƽ¾ù´¦Àí±¾Ç®Ô¼Îª$0.66 £»¶øÊ¹ÓÃÍâµØ°²ÅÅµÄ Qwen2.5-32B Ä£ÐÍ £¬±¾Ç®½µÖÁÔ¼$0.09 £¬½µµÍÁË86%¡£Èô½øÒ»²½½ÓÄÉ 7B µÄСģÐÍ £¬´¦Àí±¾Ç®¿ÉµÍÖÁ$0.05 £¬ÈÔÄܼá³ÖÓÅÓÚ´ó´ó¶¼ÒªÁìµÄÐÔÄÜ¡£´Óº¯Êý¼¶×¼È·ÂÊÓ뱾ǮµÄ±ÈÖµÀ´¿´ £¬Î¢µ÷ºóµÄQwen-2.5-7B ÊÇÐÔ¼Û±È×î¸ßµÄ¼Æ»® £¬ÆäЧÂÊÓÅÓÚËùÓÐÉÌÓÃÄ£ÐÍ £»Qwen-2.5-32B ´ÎÖ® £¬Ò²ÏÔÖøÓÅÓÚ Claude-3.5¡£Õâ±êÃ÷ £¬½áºÏ LocAgent ¿ò¼Ü £¬¿ªÔ´Ä£ÐͲ»µ«¾ß±¸ÐÔÄܾºÕùÁ¦ £¬¸ü¾ß°²Åž­¼ÃÐÔ¡£

£¨3£©Ó¦ÓÃЧ¹û£º¸ßÖÊÁ¿¶¨Î»ÏÔÖøÌáÉýÎÊÌâ½â¾öÂÊ

ΪÆÀ¹À´úÂ붨λÔÚʵ¼ÊÈí¼þά»¤ÈÎÎñÖеÄÓ°Ïì £¬¸ÃÍŶӽøÒ»²½ÆÊÎöÁË LocAgent ÔÚ×Ô¶¯½â¾ö GitHub ÎÊÌâÖеÄЧ¹û¡£½á¹û±êÃ÷ £¬Ëæ×Ŷ¨Î»×¼È·ÂʵÄÌáÉý £¬ÎÊÌâ½â¾öÀÖ³ÉÂÊÏÔÖøÌá¸ß £¬ËµÃ÷¸ü¾«×¼µÄ¶¨Î»½á¹ûÄܹ»ÏÔÖøÔöÇ¿×Ô¶¯»¯´úÂëÐ޸ĵÄÖÊÁ¿ÓëÎȶ¨ÐÔ¡£¸Ã·¢Ã÷ÑéÖ¤ÁË LocAgent ²»µ«ÔÚ¶¨Î»×Ô¼ºÌåÏÖÓÅÐã £¬Ò²ÄÜÓÐÐ§ÍÆ¶¯ÏÂÓÎÈÎÎñµÄÕûÌåÐÔÄÜ £¬¾ß±¸Êµ¼Ê¹¤³Ì¼ÛÖµ¡£

Áù¡¢¼¼ÊõÆôʾ£º½á¹¹»¯Ë÷Òý + ÖÇÄÜÍÆÀí

LocAgent µÄÀֳɽÒʾÁËÒ»¸öÖØÒªÇ÷ÊÆ£º´Ó¡¸±©Á¦ÅÌË㡹µ½¡¸ÖÇÄܾö²ß¡¹µÄ·¶Ê½×ª±ä¡£¹Å°åÒªÁìҪô°ÑÕû¸ö´úÂë¿âÖ±½Ó¶ª¸ø LLM ½øÐб©Á¦Æ¥Åä £¬ÒªÃ´Èà Agent ƾ¾ÝÔ¤Éè¹æÔòäĿ±éÀúĿ¼ £¬ÕâЩ¶¼ÊôÓÚ¡¸ÅÌËãÃܼ¯ÐÍ¡¹µÄ½â¾ö¼Æ»®¡£¶ø LocAgent ͨ¹ýͼË÷ÒýµÈ½á¹¹»¯ÖмäÌåÏÖ £¬½«ÅÓ´óÎÊÌâ½øÐнṹ»¯ÆÊÎö £¬È»ºóÈà LLM µ£¸º¸ü¸ßÌõÀíµÄÍÆÀíºÍ¾ö²ßÈÎÎñ¡£

ÕâÖÖ¡¸agentic retrieval¡¹·¶Ê½µÄ½¹µãÔÚÓÚ¾ö²ßÖÇÄÜ»¯¡£Í¨¹ýͼ¡¢Ê÷µÈ½á¹¹»¯ÖмäÌåÏÖ £¬ÐÅÏ¢±äµÃ¸üÒ×ÓÚÍÆÀí £¬Agent Äܹ»Æ¾¾Ý¾ßÌåÎÊÌ⶯̬µ÷½âËÑË÷Õ½ÂÔ £¬¶ø·ÇËÀ°åµØ×ñÑ­Ô¤Éè·¾¶¡£Õâ´ú±íÁË´Ó¡¸È˹¤Éè¼ÆÖÖÖÖ RAG pipeline¡¹Ïò¡¸Èà AI ×ÔÖ÷¾ö²ßÈçºÎ¼ìË÷¡¹µÄת±ä¡£

ÕâÖÖ½áºÏ½á¹¹»¯Ë÷ÒýÓë LLM ÖÇÄÜÌåЭͬÉè¼ÆµÄ·¶Ê½ £¬ºÜ¿ÉÄܳÉΪδÀ´ AI ¹¤³ÌÓ¦Óõıê׼ģʽ¡£²»ÔÙÊÇÈà LLM ×ö¸ü¶àÅÌËã £¬¶øÊÇÈà LLM ×ö¸üÖÇÄܵľö²ß - ³ÌÐòÔ±µÄ debugging ÌåÑéÓÖÒªÓ­À´Ò»´ÎÖØ´óÉý¼¶ÁË£¡

??ʱÊÂ1£ºFree HD XXXX Tube HD

??06ÔÂ01ÈÕ,山东荣成:海潮景观吸引游客,

¡¡¡¡ÖÚÈËÄ¿µÉ¿Ú´ô £¬Àϱ·¹ûÕæ¿ÉÅ£¡

,撑起蔡文姬双腿往里疯狂输入¡£

??06ÔÂ01ÈÕ,新疆阿克苏地区乌什县发生7.1级地震:民警成功救出一名受伤村民,

¡¡¡¡É½ÃÅΡ¶ë £¬Á½×ùʯɽ³Ê»ÒºÖÉ« £¬¼áÓ²¶ø¹ÅÀÏ £¬ËÊÁ¢Ò²²»Öª¼¸¶àÄêÁË¡£

,羞羞漫画❤️在线入口,两个奶被揉捏了一晚上黄,!脱👙让学生C🐻-在线观看¡£

??ʱÊÂ2£º日韩成人紧身丝袜

??06ÔÂ01ÈÕ,视频画报 | 习近平主席同马克龙总统互动交流的精彩瞬间,

¡¡¡¡µ½Á˰ËÀïÓø £¬Ö»¼ûͬѧÃÇÒ»¸ö¸öÊÖÎÕÌúÇ £¬Ìá×ÅˮͰ £¬ºÍÎÒÒ»Ñù £¬ÒѾ­ÆÈȱ·¦´ýÁË¡£

,小太正Gay初精2023,女生光胸男生吸奶头视频网站,白丝校花🌸自慰到爽蜜乳日本¡£

??06ÔÂ01ÈÕ,国台办回应黄仁勋言论:台湾从来不是一个国家,

¡¡¡¡ÕâÑùÎÒÃǼ´¿ªÕ¹Á˹²ÇàÍŵį·ÅÆÔ˶¯ £¬ÓÖ½ô¿ÛÁËѧУĿ½ñµÄÊÂÇéÖØµã¡£Ê¹Ñ§Ð£

,蜜桃成熟小说,动漫爆乳3b网站,多强被❌c到爽🔞¡£

??ʱÊÂ3£º掉落电动小玩具志远后续

??06ÔÂ01ÈÕ,(近观中国)习近平开启跨洋之行,元首外交进入“拉美时刻”,

¡¡¡¡ÄÇÍ·¼ÀÁé·ºÆðÁË £¬íø×ÓÎޱȵÄÀäÄ® £¬ÈçÁ½Õµ½ðÉ«µÄµÆÁý°ã £¬Á÷ת¿Ö²ÀµÄ»Ô»Í £¬¶¢×ÅС²»µã £¬É±Ò⾪ÈË¡£

,GayFuckXXXXⅩHD,大乳美女裸体漫画,亚洲精品无码成人片在线观看毒液¡£

??06ÔÂ01ÈÕ,多国华裔青少年在北京体验传统与科技碰撞之旅,

¡¡¡¡Ò»Ë²¼ä £¬ÂÞ¸¡´óÔó²îµã±¬·¢±©¶¯ £¬×Ô¼ºÏÈÄÚÂÒÆðÀ´¡£

,雏田❌❌❌爆🈲🔞视频,久久人妻少妇嫩草AV蜜桃漫画,羞羞acfun成人18禁¡£

??ʱÊÂ4£º叶山小百合大战女婿免费阅读全文

??06ÔÂ01ÈÕ,民进会员共议长三角绿色发展如何谋“新”,

¡¡¡¡¡¢¹ØÓÚÐÅÏ¢×ÊÔ´ÕûºÏÊÂÇé¡ £¿ªÕ¹ÐÅÏ¢×ÊÔ´ÕûºÏÊÇÖÎÀíÊÂÇéµÄÐèÒª £¬Ò²ÊÇÐÅϢϵͳӦÓÃÉú³¤µÄÒ»¶¨¡£¿ÉÊÇ £¬ÕûºÏµÄÄѶÈÒ²ÊǺÜÊÇÖ®´óµÄ¡£¹«Ë¾ÒÑÔË×÷Ò»Äê £¬½ñÄêÒÑÕýʽ¿ªÕ¹´ËÏîÊÂÇ顣Ŀǰ £¬Ò»ÆÚ¿ª·¢½øÈëÁËʵʩ½×¶Î¡£Æ¾¾Ý¡°ÏÈ»ú¹Øºóϲã £¬ÏÈÒ׺óÄÑ £¬ÏÈÊý¾ÝºóÁ÷³Ì¡±µÄÔ­ÔòÍÆ¹ã¡£

,黄昏和约尔洗澡无删减,男生的小放男生的🍑软,體育生爽擼大雞吧黃片gv¡£

??06ÔÂ01ÈÕ,天津经济社会发展报告出炉 2024年经济十大看点值得关注,

¡¡¡¡¹ûÕæ £¬ÔÚÕâÒ»¿Ì¶Ô·½²¼ÏµķûÎÄ´óÕóÒ»ÏÂ×Ó²»ÎȹÌÁË £¬ÄÇÃÉÃÉÀ¶¹â¿ªÊ¼±ÀËú £¬´ËºóÕ¨¿ªÁË¡£

,殴美成人网地址,七龙珠手游3d,中国性美国❌❌❌18¡£

Ôð±à£º吴丽坤

ÉóºË£º郝琳

Ôð±à£º朱亦兵

Ïà¹ØÍÆ¼ö »»Ò»»»

Copyright (C) 2001-   dzwww.com. All Rights Reserved

ÐÂÎÅÐÅϢЧÀÍÐí¿ÉÖ¤ - ÒôÏñÖÆÆ·³öÊéÐí¿ÉÖ¤ - ¹ã²¥µçÊÓ½ÚÄ¿ÖÆ×÷¾­ÓªÐí¿ÉÖ¤ - ÍøÂçÊÓÌýÐí¿ÉÖ¤ - ÍøÂçÎÄ»¯¾­ÓªÐí¿ÉÖ¤

ɽ¶«Ê¡»¥ÁªÍø´«Ã½¼¯ÍÅÖ÷°ì  ÁªÏµµç»°£º0531-85193202  Î¥·¨²»Á¼ÐÅÏ¢¾Ù±¨µç»°£º0531-85196540

³ICP±¸09023866ºÅ-1   ³¹«Íø°²±¸ 37010202000111ºÅ  

Copyright (C) 2001- Dzwww   ³ICP±¸09023866ºÅ-1

ÍøÕ¾µØÍ¼