Construct A Totally Self-Verifying Information Operations AI Agent Utilizing Native Hugging Face Fashions For Automated Planning, Execution, And Testing

On this tutorial, we construct a self-verifying DataOps AIAgent that may plan, execute, and take a look at knowledge operations robotically utilizing native Hugging Face fashions. We design the agent with three clever roles: a Planner that creates an execution technique, an Executor that writes and runs code utilizing pandas, and a Tester that validates the outcomes for accuracy and consistency. By utilizing Microsoft’s Phi-2 mannequin domestically in Google Colab, we be sure that the workflow stays environment friendly, reproducible, and privacy-preserving whereas demonstrating how LLMs can automate advanced data-processing duties end-to-end. Take a look at the FULL CODES right here.

!pip set up -q transformers speed up bitsandbytes scipy
import json, pandas as pd, numpy as np, torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline, BitsAndBytesConfig


MODEL_NAME = "microsoft/phi-2"


class LocalLLM:
   def __init__(self, model_name=MODEL_NAME, use_8bit=False):
       print(f"Loading mannequin: {model_name}")
       self.tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
       if self.tokenizer.pad_token is None:
           self.tokenizer.pad_token = self.tokenizer.eos_token
       model_kwargs = {"device_map": "auto", "trust_remote_code": True}
       if use_8bit and torch.cuda.is_available():
           model_kwargs["quantization_config"] = BitsAndBytesConfig(load_in_8bit=True)
       else:
           model_kwargs["torch_dtype"] = torch.float32 if not torch.cuda.is_available() else torch.float16
       self.mannequin = AutoModelForCausalLM.from_pretrained(model_name, **model_kwargs)
       self.pipe = pipeline("text-generation", mannequin=self.mannequin, tokenizer=self.tokenizer,
                            max_new_tokens=512, do_sample=True, temperature=0.3, top_p=0.9,
                            pad_token_id=self.tokenizer.eos_token_id)
       print("✓ Mannequin loaded efficiently!n")


   def generate(self, immediate, system_prompt="", temperature=0.3):
       if system_prompt:
           full_prompt = f"Instruct: {system_prompt}nn{immediate}nOutput:"
       else:
           full_prompt = f"Instruct: {immediate}nOutput:"
       output = self.pipe(full_prompt, temperature=temperature, do_sample=temperature>0,
                          return_full_text=False, eos_token_id=self.tokenizer.eos_token_id)
       end result = output[0]['generated_text'].strip()
       if "Instruct:" in end result:
           end result = end result.cut up("Instruct:")[0].strip()
       return end result

We set up the required libraries and cargo the Phi-2 mannequin domestically utilizing Hugging Face Transformers. We create a LocalLLM class that initializes the tokenizer and mannequin, helps elective quantization, and defines a generate technique to provide textual content outputs. We be sure that the mannequin runs easily on each CPU and GPU, making it very best to be used on Colab. Take a look at the FULL CODES right here.

PLANNER_PROMPT = """You're a Information Operations Planner. Create an in depth execution plan as legitimate JSON.


Return ONLY a JSON object (no different textual content) with this construction:
{"steps": ["step 1","step 2"],"expected_output":"description","validation_criteria":["criteria 1","criteria 2"]}"""


EXECUTOR_PROMPT = """You're a Information Operations Executor. Write Python code utilizing pandas.


Necessities:
- Use pandas (imported as pd) and numpy (imported as np)
- Retailer ultimate end in variable 'end result'
- Return ONLY Python code, no explanations or markdown"""


TESTER_PROMPT = """You're a Information Operations Tester. Confirm execution outcomes.


Return ONLY a JSON object (no different textual content) with this construction:
{"handed":true,"points":["any issues found"],"suggestions":["suggestions"]}"""


class DataOpsAgent:
   def __init__(self, llm=None):
       self.llm = llm or LocalLLM()
       self.historical past = []


   def _extract_json(self, textual content):
       strive:
           return json.hundreds(textual content)
       besides:
           begin, finish = textual content.discover('{'), textual content.rfind('}')+1
           if begin >= 0 and finish > begin:
               strive:
                   return json.hundreds(textual content[start:end])
               besides:
                   go
       return None

We outline the system prompts for the Planner, Executor, and Tester roles of our DataOps Agent. We then initialize the DataOpsAgent class with helper strategies and a JSON extraction utility to parse structured responses. We put together the inspiration for the agent’s reasoning and execution pipeline. Take a look at the FULL CODES right here.

 def plan(self, job, data_info):
       print("n" + "="*60)
       print("PHASE 1: PLANNING")
       print("="*60)
       immediate = f"Job: {job}nnData Info:n{data_info}nnCreate an execution plan as JSON with steps, expected_output, and validation_criteria."
       plan_text = self.llm.generate(immediate, PLANNER_PROMPT, temperature=0.2)
       self.historical past.append(("PLANNER", plan_text))
       plan = self._extract_json(plan_text) or {"steps":[task],"expected_output":"Processed knowledge","validation_criteria":["Result generated","No errors"]}
       print(f"n📋 Plan Created:")
       print(f"  Steps: {len(plan.get('steps', []))}")
       for i, step in enumerate(plan.get('steps', []), 1):
           print(f"    {i}. {step}")
       print(f"  Anticipated: {plan.get('expected_output', 'N/A')}")
       return plan


   def execute(self, plan, data_context):
       print("n" + "="*60)
       print("PHASE 2: EXECUTION")
       print("="*60)
       steps_text="n".be a part of(f"{i}. {s}" for i, s in enumerate(plan.get('steps', []), 1))
       immediate = f"Job Steps:n{steps_text}nnData accessible: DataFrame 'df'n{data_context}nnWrite Python code to execute these steps. Retailer ultimate end in 'end result' variable."
       code = self.llm.generate(immediate, EXECUTOR_PROMPT, temperature=0.1)
       self.historical past.append(("EXECUTOR", code))
       if "```python" in code: code = code.cut up("```python")[1].cut up("```")[0]
       elif "```" in code: code = code.cut up("```")[1].cut up("```")[0]
       strains = []
       for line in code.cut up('n'):
           s = line.strip()
           if s and (not s.startswith('#') or 'import' in s):
               strains.append(line)
       code="n".be a part of(strains).strip()
       print(f"n💻 Generated Code:n" + "-"*60)
       for i, line in enumerate(code.cut up('n')[:15],1):
           print(f"{i:2}. {line}")
       if len(code.cut up('n'))>15: print(f"    ... ({len(code.cut up('n'))-15} extra strains)")
       print("-"*60)
       return code

We implement the Planning and Execution phases of the agent. We let the Planner create detailed job steps and validation standards, after which the Executor generates corresponding Python code based mostly on pandas to carry out the duty. We visualize how the agent autonomously transitions from reasoning to producing actionable code. Take a look at the FULL CODES right here.

def take a look at(self, plan, end result, execution_error=None):
       print("n" + "="*60)
       print("PHASE 3: TESTING & VERIFICATION")
       print("="*60)
       result_desc = f"EXECUTION ERROR: {execution_error}" if execution_error else f"Outcome kind: {kind(end result).__name__}n"
       if not execution_error:
           if isinstance(end result, pd.DataFrame):
               result_desc += f"Form: {end result.form}nColumns: {checklist(end result.columns)}nSample:n{end result.head(3).to_string()}"
           elif isinstance(end result, (int,float,str)):
               result_desc += f"Worth: {end result}"
           else:
               result_desc += f"Worth: {str(end result)[:200]}"
       criteria_text="n".be a part of(f"- {c}" for c in plan.get('validation_criteria', []))
       immediate = f"Validation Standards:n{criteria_text}nnExpected: {plan.get('expected_output', 'N/A')}nnActual Outcome:n{result_desc}nnEvaluate if end result meets standards. Return JSON with handed (true/false), points, and proposals."
       test_result = self.llm.generate(immediate, TESTER_PROMPT, temperature=0.2)
       self.historical past.append(("TESTER", test_result))
       test_json = self._extract_json(test_result) or {"handed":execution_error is None,"points":["Could not parse test result"],"suggestions":["Review manually"]}
       print(f"n✓ Take a look at Outcomes:n  Standing: {'✅ PASSED' if test_json.get('handed') else '❌ FAILED'}")
       if test_json.get('points'):
           print("  Points:")
           for subject in test_json['issues'][:3]:
               print(f"    • {subject}")
       if test_json.get('suggestions'):
           print("  Suggestions:")
           for rec in test_json['recommendations'][:3]:
               print(f"    • {rec}")
       return test_json


   def run(self, job, df=None, data_info=None):
       print("n🤖 SELF-VERIFYING DATA-OPS AGENT (Native HF Mannequin)")
       print(f"Job: {job}n")
       if data_info is None and df isn't None:
           data_info = f"Form: {df.form}nColumns: {checklist(df.columns)}nSample:n{df.head(2).to_string()}"
       plan = self.plan(job, data_info)
       code = self.execute(plan, data_info)
       end result, error = None, None
       strive:
           local_vars = {'pd': pd, 'np': np, 'df': df}
           exec(code, local_vars)
           end result = local_vars.get('end result')
       besides Exception as e:
           error = str(e)
           print(f"n⚠️  Execution Error: {error}")
       test_result = self.take a look at(plan, end result, error)
       return {'plan': plan,'code': code,'end result': end result,'take a look at': test_result,'historical past': self.historical past}

We deal with the Testing and Verification section of our workflow. We let the agent consider its personal output towards predefined validation standards and summarize the end result as a structured JSON. We then combine all three phases, planning, execution, and testing, right into a single self-verifying pipeline that ensures full automation. Take a look at the FULL CODES right here.

def demo_basic(agent):
   print("n" + "#"*60)
   print("# DEMO 1: Gross sales Information Aggregation")
   print("#"*60)
   df = pd.DataFrame({'product':['A','B','A','C','B','A','C'],
                      'gross sales':[100,150,200,80,130,90,110],
                      'area':['North','South','North','East','South','West','East']})
   job = "Calculate complete gross sales by product"
   output = agent.run(job, df)
   if output['result'] isn't None:
       print(f"n📊 Ultimate Outcome:n{output['result']}")
   return output


def demo_advanced(agent):
   print("n" + "#"*60)
   print("# DEMO 2: Buyer Age Evaluation")
   print("#"*60)
   df = pd.DataFrame({'customer_id':vary(1,11),
                      'age':[25,34,45,23,56,38,29,41,52,31],
                      'purchases':[5,12,8,3,15,7,9,11,6,10],
                      'spend':[500,1200,800,300,1500,700,900,1100,600,1000]})
   job = "Calculate common spend by age group: younger (underneath 35) and mature (35+)"
   output = agent.run(job, df)
   if output['result'] isn't None:
       print(f"n📊 Ultimate Outcome:n{output['result']}")
   return output


if __name__ == "__main__":
   print("🚀 Initializing Native LLM...")
   print("Utilizing CPU mode for max compatibilityn")
   strive:
       llm = LocalLLM(use_8bit=False)
       agent = DataOpsAgent(llm)
       demo_basic(agent)
       print("nn")
       demo_advanced(agent)
       print("n" + "="*60)
       print("✅ Tutorial Full!")
       print("="*60)
       print("nKey Options:")
       print("  • 100% Native - No API calls required")
       print("  • Makes use of Phi-2 from Microsoft (2.7B params)")
       print("  • Self-verifying 3-phase workflow")
       print("  • Runs on free Google Colab CPU/GPU")
   besides Exception as e:
       print(f"n❌ Error: {e}")
       print("Troubleshooting:n1. pip set up -q transformers speed up scipyn2. Restart runtimen3. Attempt a special mannequin")

We constructed two demo examples to check the agent’s capabilities utilizing easy gross sales and buyer datasets. We initialize the mannequin, execute the Information-Ops workflow, and observe the complete cycle from planning to validation. We conclude the tutorial by summarizing key advantages and inspiring additional experimentation with native fashions.

In conclusion, we created a completely autonomous and self-verifying DataOps system powered by an area Hugging Face mannequin. We expertise how every stage, planning, execution, and testing, seamlessly interacts to provide dependable outcomes with out counting on any cloud APIs. This workflow highlights the power of native LLMs, comparable to Phi-2, for light-weight automation and conjures up us to broaden this structure for extra superior knowledge pipelines, validation frameworks, and multi-agent knowledge programs sooner or later.

Take a look at the FULL CODES right here. Be at liberty to take a look at our GitHub Web page for Tutorials, Codes and Notebooks. Additionally, be at liberty to comply with us on Twitter and don’t neglect to affix our 100k+ ML SubReddit and Subscribe to our Publication. Wait! are you on telegram? now you’ll be able to be a part of us on telegram as properly.

Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is dedicated to harnessing the potential of Synthetic Intelligence for social good. His most up-to-date endeavor is the launch of an Synthetic Intelligence Media Platform, Marktechpost, which stands out for its in-depth protection of machine studying and deep studying information that’s each technically sound and simply comprehensible by a large viewers. The platform boasts of over 2 million month-to-month views, illustrating its reputation amongst audiences.

🙌 Comply with MARKTECHPOST: Add us as a most well-liked supply on Google.

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the newest breakthroughs, get unique updates, and join with a worldwide community of future-focused thinkers.
Unlock tomorrow’s tendencies at the moment: learn extra, subscribe to our publication, and grow to be a part of the NextTech neighborhood at NextTech-news.com

What's Hot

Easy methods to Flip off Balloons in SOLIDWORKS Inspection

Dealism Raises $15 Million Angel Spherical to Construct “Vibe Promoting” AI Brokers for International Gross sales Groups

Past is shaping as much as be an excellent return for Samus

Construct a Totally Self-Verifying Information Operations AI Agent Utilizing Native Hugging Face Fashions for Automated Planning, Execution, and Testing

NVIDIA AI Introduces TiDAR: A Hybrid Diffusion Autoregressive Structure For Excessive Throughput LLM Inference

Meet SDialog: An Open-Supply Python Toolkit for Constructing, Simulating, and Evaluating LLM-based Conversational Brokers Finish-to-Finish

How Highly effective are Diffusion LLMs? Rethinking Era with Any-Course of Masked Diffusion Fashions

Easy methods to Flip off Balloons in SOLIDWORKS Inspection

Dealism Raises $15 Million Angel Spherical to Construct “Vibe Promoting” AI Brokers for International Gross sales Groups

Past is shaping as much as be an excellent return for Samus

Easy methods to Flip off Balloons in SOLIDWORKS Inspection

Dealism Raises $15 Million Angel Spherical to Construct “Vibe Promoting” AI Brokers for International Gross sales Groups

Past is shaping as much as be an excellent return for Samus

What's Hot

Construct a Totally Self-Verifying Information Operations AI Agent Utilizing Native Hugging Face Fashions for Automated Planning, Execution, and Testing

Related Posts

Subscribe For Latest Updates