All Insights
#GenerativeAI
June 2026

Sunday Coffee & Code: Last week the RFP Marketplace PoC worked. This weeks was all about improving outputs (prompt improvements and testing Gemma4)

Last weekโ€™s dry run proved the PoC worked. A buyer could issue an RFP, a supplier could upload their company knowledge, the responder agent could generate a grounded draft response, and the assessor agent could evaluate that response against weighted criteria with rationale and evidence.

By Steve Harris

Last weekโ€™s dry run proved the PoC worked. A buyer could issue an RFP, a supplier could upload their company knowledge, the responder agent could generate a grounded draft response, and the assessor agent could evaluate that response against weighted criteria with rationale and evidence.

๐—ฆ๐—ผ ๐˜๐—ต๐—ถ๐˜€ ๐˜„๐—ฒ๐—ฒ๐—ธ๐—ฒ๐—ป๐—ฑ ๐—œ ๐—ณ๐—ผ๐—ฐ๐˜‚๐˜€๐—ฒ๐—ฑ ๐—ผ๐—ป ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ ๐—พ๐˜‚๐—ฎ๐—น๐—ถ๐˜๐˜†.

๐˜›๐˜ฉ๐˜ฆ ๐˜ง๐˜ช๐˜ณ๐˜ด๐˜ต ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜จ ๐˜ ๐˜ค๐˜ฉ๐˜ข๐˜ฏ๐˜จ๐˜ฆ๐˜ฅ ๐˜ธ๐˜ข๐˜ด ๐˜ต๐˜ฉ๐˜ฆ ๐˜ฑ๐˜ณ๐˜ฐ๐˜ฎ๐˜ฑ๐˜ต. The original prompt was a useful starting point, but it was too broad. It produced something plausible, but not something I would want to hand to a proposal team as a first draft. I reworked it to make the responder agent pay much closer attention to the specific RFP requirement, use the supporting RFP information properly, and produce something closer to a proposal section than a general capability statement (๐˜ข๐˜ญ๐˜ญ ๐˜ฃ๐˜ข๐˜ด๐˜ฆ๐˜ฅ ๐˜ฐ๐˜ฏ ๐˜ฎ๐˜บ ๐˜ด๐˜ต๐˜ข๐˜ฏ๐˜ฅ๐˜ข๐˜ณ๐˜ฅ ๐˜ฑ๐˜ณ๐˜ฐ๐˜ฎ๐˜ฑ๐˜ต๐˜ช๐˜ฏ๐˜จ ๐˜ต๐˜ฆ๐˜ฎ๐˜ฑ๐˜ญ๐˜ข๐˜ต๐˜ฆ).

That alone helped. The improved prompt produced a clearer and more focused response in about 52 seconds.

๐˜›๐˜ฉ๐˜ฆ๐˜ฏ ๐˜ ๐˜ธ๐˜ข๐˜ฏ๐˜ต๐˜ฆ๐˜ฅ ๐˜ต๐˜ฐ ๐˜ต๐˜ฆ๐˜ด๐˜ต ๐˜ต๐˜ฉ๐˜ฆ ๐˜ฆ๐˜ง๐˜ง๐˜ฆ๐˜ค๐˜ต ๐˜ฐ๐˜ง ๐˜ข ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ ๐˜ค๐˜ฉ๐˜ข๐˜ฏ๐˜จ๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ. I updated Ollama and moved from llama3.1:8b to Google gemma4:26b.

The full sample prompt is over 36,000 characters, not that big. The gemma4 model just spun. A simple Harry Potter prompt worked fine, so the model was not completely broken. Testing via ๐˜ฐ๐˜ญ๐˜ญ๐˜ข๐˜ฎ๐˜ข ๐˜ณ๐˜ถ๐˜ฏ, it just dropped back to the prompt. Tested a smaller prompt which worked and Iฬฒ ฬฒdฬฒoฬฒnฬฒโ€™ฬฒtฬฒ ฬฒkฬฒnฬฒoฬฒwฬฒ ฬฒwฬฒhฬฒyฬฒ ฬฒIฬฒ ฬฒdฬฒiฬฒdฬฒnฬฒโ€™ฬฒtฬฒ ฬฒcฬฒlฬฒuฬฒeฬฒ ฬฒiฬฒnฬฒ ฬฒoฬฒnฬฒ ฬฒtฬฒhฬฒiฬฒsฬฒ ฬฒ-ฬฒ ฬฒIฬฒ ฬฒhฬฒaฬฒvฬฒeฬฒ ฬฒcฬฒoฬฒmฬฒeฬฒ ฬฒaฬฒcฬฒrฬฒoฬฒsฬฒsฬฒ ฬฒtฬฒhฬฒeฬฒ ฬฒpฬฒrฬฒoฬฒbฬฒlฬฒeฬฒmฬฒ ฬฒbฬฒeฬฒfฬฒoฬฒrฬฒeฬฒ.ฬฒ

So I watched the Ollama logs, ๐˜ซ๐˜ฐ๐˜ถ๐˜ณ๐˜ฏ๐˜ข๐˜ญ๐˜ค๐˜ต๐˜ญ -๐˜ง -๐˜ถ ๐˜ฐ๐˜ญ๐˜ญ๐˜ข๐˜ฎ๐˜ข and there it was, ๐˜ต๐˜ณ๐˜ถ๐˜ฏ๐˜ค๐˜ข๐˜ต๐˜ช๐˜ฏ๐˜จ ๐˜ช๐˜ฏ๐˜ฑ๐˜ถ๐˜ต ๐˜ฑ๐˜ณ๐˜ฐ๐˜ฎ๐˜ฑ๐˜ต ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ต=4096 - the ๐˜ฏ๐˜ถ๐˜ฎ_๐˜ค๐˜ต๐˜น problem. Quick fix to the API code, a re-test and all was fine.

The response from the gemma4:26b was much better, followed the requirement more closely, used the RFP context more effectively, and read much more like something a human could refine rather than rewrite. (I also like how the Gemma4 model shows itโ€™s reasoning - something I have captured in the past as a form of ๐˜ณ๐˜ฆ๐˜ข๐˜ด๐˜ฐ๐˜ฏ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ถ๐˜ฅ๐˜ช๐˜ต ๐˜ต๐˜ณ๐˜ข๐˜ช๐˜ญ).

Still a POC. Still rough around the edges but ๐˜‚๐˜€๐—ถ๐—ป๐—ด ๐—ฎ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—น๐—ฒ๐˜๐—ฒ๐—น๐˜† ๐—ผ๐—ณ๐—ณ๐—น๐—ถ๐—ป๐—ฒ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ, ๐—ฎ ๐—ฐ๐—ผ๐—ฟ๐—ฟ๐—ฒ๐—ฐ๐˜๐—น๐˜† ๐—ฐ๐—ผ๐—ป๐—ณ๐—ถ๐—ด๐˜‚๐—ฟ๐—ฒ๐—ฑ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น, ๐—ฎ๐—ฟ๐—ผ๐˜‚๐—ป๐—ฑ ๐—ฎ๐—ป ๐—ฅ๐—™๐—ฃ ๐—บ๐—ฎ๐—ฟ๐—ธ๐—ฒ๐˜๐—ฝ๐—น๐—ฎ๐—ฐ๐—ฒ ๐˜๐—ผ ๐—ฏ๐—ฟ๐—ถ๐—ป๐—ด ๐—ฏ๐˜‚๐˜†๐—ฒ๐—ฟ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐˜€๐—ฒ๐—น๐—น๐—ฒ๐—ฟ๐˜€ ๐˜๐—ผ๐—ด๐—ฒ๐˜๐—ต๐—ฒ๐—ฟ ๐˜๐—ผ ๐—ฟ๐—ฒ๐—ฑ๐˜‚๐—ฐ๐—ฒ ๐—ณ๐—ฟ๐—ถ๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ฎ๐—ป๐—ฑ ๐—ผ๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜€๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฒ๐—ป๐—ฑ ๐˜๐—ผ ๐—ฒ๐—ป๐—ฑ ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€ ๐—น๐—ผ๐—ผ๐—ธ๐˜€ ๐˜๐—ผ ๐—บ๐—ฒ ๐˜๐—ผ ๐—ฏ๐—ฒ ๐—ฎ ๐—ฟ๐—ฒ๐—ฎ๐—น๐—ถ๐˜๐˜†.

Want to Discuss This Topic?

Steve is always happy to have a direct conversation.