No Code Backend Engine for Internal Link Automation [Part 2]
I finally set up something I’ve been wanting to build for a long time, a small backend engine that quietly runs in the background and keeps a record of all new blogs published and create semantic embeddings summary (to help my internal linking automation idea)
It starts with something simple: the RSS feed.
Every time a new blog goes live, the RSS updates. And the moment that happens, the Pabbly flow wakes up.
From there, it does all the heavy lifting.
It pulls the new blog and extracts everything that matters
The flow takes the blog URL and breaks the page down into clean, usable parts:
the title
all H2 and H3 headings
the cleaned body text
(without images, footers, navigation bars, or any HTML junk)
Basically, it strips the post down to its pure meaning — the actual writing.
Then it creates a semantic summary for embeddings
This part is fun.
Using a custom prompt, the flow generates a short embedding summary that captures:
what the blog is about
the topics it covers
the intent behind the content
the structure reflected in its headings
This summary goes into the database and becomes the “embedding memory” of that blog post. Perfect for semantic matching, clustering, and — most importantly — internal linking.
Everything is stored in a clean backend database
Each new blog becomes a complete, structured record:
URL
title
all headings
cleaned content
embedding summary
timestamp
Over time, this creates a growing content intelligence layer behind the blog — something I’ll use for automations, recommendations, and even content revivals.
Why this matters
This system means:
I never have to manually check new blogs
I never have to copy-paste titles, headings, or text
I never have to rebuild internal links after publishing
I now have a database that “understands” my content semantically
It’s a small automation, but the impact is huge.
From the moment a blog goes live, it becomes part of a smarter, self-updating backend.
What comes next
Now that the backend is ready, the next layer is automatic internal linking.
Since every blog has:
a summary
a structure
and a meaning stored semantically
The system can soon recommend:
which older blogs to link
where to link them
what anchor text matches
and how to keep everything updated as content grows
This is the foundation.
And honestly, it feels good to finally have it running.
Also Read:
Part 1 of Internal Linking System [Backend Preparation]

