Regex for strings containing substring that contains substring











up vote
2
down vote

favorite












Let's say I have this string:



[section][module_text id="123"]text[/module][module_number id=""]000[/module][/section]


I can find each entire [module...[/module...] substring with



([module)(.*?)([/module.*?])


But I would like to only match the [module] string that has a digit in the id="" parameter. In this case the first [module].



I've been trying a bunch of stuff but i cant seem to get any closer.










share|improve this question






















  • Are you sure you don't want tu use XML with an XML parser ?
    – Blusky
    Nov 20 at 7:21















up vote
2
down vote

favorite












Let's say I have this string:



[section][module_text id="123"]text[/module][module_number id=""]000[/module][/section]


I can find each entire [module...[/module...] substring with



([module)(.*?)([/module.*?])


But I would like to only match the [module] string that has a digit in the id="" parameter. In this case the first [module].



I've been trying a bunch of stuff but i cant seem to get any closer.










share|improve this question






















  • Are you sure you don't want tu use XML with an XML parser ?
    – Blusky
    Nov 20 at 7:21













up vote
2
down vote

favorite









up vote
2
down vote

favorite











Let's say I have this string:



[section][module_text id="123"]text[/module][module_number id=""]000[/module][/section]


I can find each entire [module...[/module...] substring with



([module)(.*?)([/module.*?])


But I would like to only match the [module] string that has a digit in the id="" parameter. In this case the first [module].



I've been trying a bunch of stuff but i cant seem to get any closer.










share|improve this question













Let's say I have this string:



[section][module_text id="123"]text[/module][module_number id=""]000[/module][/section]


I can find each entire [module...[/module...] substring with



([module)(.*?)([/module.*?])


But I would like to only match the [module] string that has a digit in the id="" parameter. In this case the first [module].



I've been trying a bunch of stuff but i cant seem to get any closer.







php regex






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 20 at 7:20









kylmark

98111




98111












  • Are you sure you don't want tu use XML with an XML parser ?
    – Blusky
    Nov 20 at 7:21


















  • Are you sure you don't want tu use XML with an XML parser ?
    – Blusky
    Nov 20 at 7:21
















Are you sure you don't want tu use XML with an XML parser ?
– Blusky
Nov 20 at 7:21




Are you sure you don't want tu use XML with an XML parser ?
– Blusky
Nov 20 at 7:21












1 Answer
1






active

oldest

votes

















up vote
3
down vote



accepted










After matching module, you can match non-] characters until you get to the id part, and after the id's ", you can match [^"]*d to ensure that the id has a digit in it:



([module[^]]*id="[^"]*d)(.*?)([/module[^]]*])


https://regex101.com/r/25JcEP/1



If the modules you want to match have ids which always start with digits, then you can simplify it a bit to



([module[^]]*id="d)(.*?)([/module[^]]*])


Note the use of a negative character class rather than .*? for the final ([/module[^]]*]) - negative character classes are a bit more efficient than lazy repetition when possible.






share|improve this answer























  • @kylmark Same regex, just use preg_match_all
    – CertainPerformance
    Nov 20 at 7:39













Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53388048%2fregex-for-strings-containing-substring-that-contains-substring%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
3
down vote



accepted










After matching module, you can match non-] characters until you get to the id part, and after the id's ", you can match [^"]*d to ensure that the id has a digit in it:



([module[^]]*id="[^"]*d)(.*?)([/module[^]]*])


https://regex101.com/r/25JcEP/1



If the modules you want to match have ids which always start with digits, then you can simplify it a bit to



([module[^]]*id="d)(.*?)([/module[^]]*])


Note the use of a negative character class rather than .*? for the final ([/module[^]]*]) - negative character classes are a bit more efficient than lazy repetition when possible.






share|improve this answer























  • @kylmark Same regex, just use preg_match_all
    – CertainPerformance
    Nov 20 at 7:39

















up vote
3
down vote



accepted










After matching module, you can match non-] characters until you get to the id part, and after the id's ", you can match [^"]*d to ensure that the id has a digit in it:



([module[^]]*id="[^"]*d)(.*?)([/module[^]]*])


https://regex101.com/r/25JcEP/1



If the modules you want to match have ids which always start with digits, then you can simplify it a bit to



([module[^]]*id="d)(.*?)([/module[^]]*])


Note the use of a negative character class rather than .*? for the final ([/module[^]]*]) - negative character classes are a bit more efficient than lazy repetition when possible.






share|improve this answer























  • @kylmark Same regex, just use preg_match_all
    – CertainPerformance
    Nov 20 at 7:39















up vote
3
down vote



accepted







up vote
3
down vote



accepted






After matching module, you can match non-] characters until you get to the id part, and after the id's ", you can match [^"]*d to ensure that the id has a digit in it:



([module[^]]*id="[^"]*d)(.*?)([/module[^]]*])


https://regex101.com/r/25JcEP/1



If the modules you want to match have ids which always start with digits, then you can simplify it a bit to



([module[^]]*id="d)(.*?)([/module[^]]*])


Note the use of a negative character class rather than .*? for the final ([/module[^]]*]) - negative character classes are a bit more efficient than lazy repetition when possible.






share|improve this answer














After matching module, you can match non-] characters until you get to the id part, and after the id's ", you can match [^"]*d to ensure that the id has a digit in it:



([module[^]]*id="[^"]*d)(.*?)([/module[^]]*])


https://regex101.com/r/25JcEP/1



If the modules you want to match have ids which always start with digits, then you can simplify it a bit to



([module[^]]*id="d)(.*?)([/module[^]]*])


Note the use of a negative character class rather than .*? for the final ([/module[^]]*]) - negative character classes are a bit more efficient than lazy repetition when possible.







share|improve this answer














share|improve this answer



share|improve this answer








edited Nov 20 at 7:32

























answered Nov 20 at 7:23









CertainPerformance

70.5k143453




70.5k143453












  • @kylmark Same regex, just use preg_match_all
    – CertainPerformance
    Nov 20 at 7:39




















  • @kylmark Same regex, just use preg_match_all
    – CertainPerformance
    Nov 20 at 7:39


















@kylmark Same regex, just use preg_match_all
– CertainPerformance
Nov 20 at 7:39






@kylmark Same regex, just use preg_match_all
– CertainPerformance
Nov 20 at 7:39




















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.





Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


Please pay close attention to the following guidance:


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53388048%2fregex-for-strings-containing-substring-that-contains-substring%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

404 Error Contact Form 7 ajax form submitting

How to know if a Active Directory user can login interactively

Refactoring coordinates for Minecraft Pi buildings written in Python