Combining Ancestry with RM11 and CLAUDE CODE

I have a tree on Ancestry which is synched to one on RM 11. It is a large tree with 200 people DNA tested on GEDmatch, and around 60-70% also with DNA on Ancestry. Recently I have been using CLAUDE CODE to execute a strategy where the Ancestry tree is the source/note repository while the RM 11 tree is the data analysis platform. The reason is because CLAUDE can provide simple instructions via SQL to enable RM 11 to manage all of the DNA operations (goal: probabilities for DNA inherited identical by descent (IBD). Given that RM 11 database is SQL, I find the DB Browser (SQLite) is ideal for both project design and execution. I am not a data scientist so I am bootstrapping on using CLAUDE. So my question is: Are there others in this group that are doing this or something similar? I recently went up to a Pro subscription but at the lowest level and it is working fine. Looking for others that might have already done things I am trying to do.

2 Likes

I am very interested in the work you describe.

I’m using SQL with RM and am actively working on my DNA matches but I haven’t used AI in this way.

Please contact me via my website

Thanks

Hi Richard. Glad to hear from you. The https link to github didn’t work. But I found my “ancient” link that I used years ago to access github and then found you by doing a name search. I see several “projects” on your page involving RM. HOWEVER: I am not a coder. I barely know enough to be dangerous. I bootstrap things so I can run R programs, etc. I am only getting used to SQL using the DB Browser SQLite. However, I have a 40 yr background in genetics as I was one of the early “gene mappers” developing DNA markers to hunt for genes causing genetic disease(s). So my goal is to take “genetic genealogy” up to a higher standard, in qualifying what people call “DNA matches” as assigning probabilities for DNA inherited identical by descent (IBD). So, considering my limitations in working at your data science level, if you are still interested in sharing info about methods and approaches I’d be happy to do so as my coming from the genetics side might be complementary to you from the data analysis side. My easiest point of contact is via email: paul.watkins@gmail.com

Thanks for going through all that trouble.
I fixed the link in the post to-

https://richardotter.github.io/

Your background is interesting. Thanks for sharing.

I’m no data scientist. I come from a background in biochemistry studying DNA secondary structure and DNA Topoisomerases and then biochemistry oriented software.

My software interests began when I used punch cards to enter in the DNA sequence of px174 phage soon after it was published and have done bioinformatics at a low level ever since.

I got my whole genome sequenced at Sequencing.com but haven’t done much with it yet.

I’m not clear that I’m following what you ssid-

So my goal is to take “genetic genealogy” up to a higher standard, in qualifying what people call “DNA matches” as assigning probabilities for DNA inherited identical by descent (IBD).

Most DNA matching today is done with SNPs. I wonder what more could be done with more sequence data.
MyHeritage is claiming that their new kits are being fully sequenced (but with much lower data redundancy and presumably lower accuracy.)

Anyway, I was wondering if Claude had access to your online DNA match data. That’s how I read your initial post.

Sorry for this rather disjointed reply…

Richard

Hi Richard. We have some similar backgrounds. I had a 30X WGS Nebula Genomics about 5 yr ago. You should check out https://wgsextract.github.io/

My current project is to use Ancestry for primary tree (source, docs, media, etc.) and RM11 via Tree Share as the “operating platform” for DNA Test (Named Group, etc.) tracking, and then DNA sharing.

Next I using GENLIB CRAN: Package GENLIB

I can use all the advanced genetic bioinformatic functions for both individual and pedigree DNA analysis.

RE Advanced DNA analysis check out the FB group called personalwgs. It covers everything latest and greatest in whole genome sequencing.

Here you will find the latest and greatest on all things genetic genealogy including recent advances in using WGS to complement and perhaps replace SNP microarrays.

No I don’t intend to give CLAUDE access to my online DNA match data. I’m using it to help structure the elements of linking different components but I’m still in a learning stage myself.

Paul