Delete all html Tags out of an excel sheet while preserving text inbetween
Hi i have an excel sheet that was exported directly form a Database. This includes various strings some of which are html tags. I want to delete all the link tags example
(<a href="mylink" target="_blank">
but keep all the other strings intact. I was trying to do this with a find and replace but i cannot figure out a Regex that will do this and also work in excel.
I have scoured the net but was not able to find a solution to this.
If i do this
<a href*>
it will delete almost all the content of the spreadsheet because it goes to the last closing tag it can find.
Is there a way to redo my search so that it gets the next closing tag and not the last one?
microsoft-excel regex
add a comment |
Hi i have an excel sheet that was exported directly form a Database. This includes various strings some of which are html tags. I want to delete all the link tags example
(<a href="mylink" target="_blank">
but keep all the other strings intact. I was trying to do this with a find and replace but i cannot figure out a Regex that will do this and also work in excel.
I have scoured the net but was not able to find a solution to this.
If i do this
<a href*>
it will delete almost all the content of the spreadsheet because it goes to the last closing tag it can find.
Is there a way to redo my search so that it gets the next closing tag and not the last one?
microsoft-excel regex
1
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39
add a comment |
Hi i have an excel sheet that was exported directly form a Database. This includes various strings some of which are html tags. I want to delete all the link tags example
(<a href="mylink" target="_blank">
but keep all the other strings intact. I was trying to do this with a find and replace but i cannot figure out a Regex that will do this and also work in excel.
I have scoured the net but was not able to find a solution to this.
If i do this
<a href*>
it will delete almost all the content of the spreadsheet because it goes to the last closing tag it can find.
Is there a way to redo my search so that it gets the next closing tag and not the last one?
microsoft-excel regex
Hi i have an excel sheet that was exported directly form a Database. This includes various strings some of which are html tags. I want to delete all the link tags example
(<a href="mylink" target="_blank">
but keep all the other strings intact. I was trying to do this with a find and replace but i cannot figure out a Regex that will do this and also work in excel.
I have scoured the net but was not able to find a solution to this.
If i do this
<a href*>
it will delete almost all the content of the spreadsheet because it goes to the last closing tag it can find.
Is there a way to redo my search so that it gets the next closing tag and not the last one?
microsoft-excel regex
microsoft-excel regex
asked Jan 28 at 14:45
SamuelSamuel
1
1
1
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39
add a comment |
1
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39
1
1
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39
add a comment |
1 Answer
1
active
oldest
votes
The easiest i.e. only, good way I know of is by use of Lynx, the old text mode web browser:lynx --dump URL_or_HTML-file
- but then, you need to find a way to run it on Windows (e.g. cygwin, or WSL).
Oh... one more thing: You should realize; what you're asking for is essentially a web browser in its full glory.
lynx is that, even with its inherent limitations and shortcomings.
Example:
$ lynx --dump www.google.se
Sök [1]Bilder [2]Maps [3]Play [4]YouTube [5]Nyheter [6]Gmail [7]Drive
[8]Mer »
[9]Webbhistorik | [10]Inställningar | [11]Logga in
_______________________________________________________
Sök på Google Jag har tur [12]Avancerad sökning
[13]Språkverktyg
Google på: [14]English
[15]Annonsera med Google [16]Företagslösningar [17]+Google
[18]Allt om Google [19]Google.com
© 2019 - [20]Sekretess - [21]Villkor
References
1. http://www.google.se/imghp?hl=sv&tab=wi
2. http://maps.google.se/maps?hl=sv&tab=wl
3. https://play.google.com/?hl=sv&tab=w8
4. http://www.youtube.com/?gl=SE&tab=w1
5. http://news.google.se/nwshp?hl=sv&tab=wn
6. https://mail.google.com/mail/?tab=wm
7. https://drive.google.com/?tab=wo
8. https://www.google.se/intl/sv/about/products?tab=wh
9. http://www.google.se/history/optout?hl=sv
10. http://www.google.se/preferences?hl=sv
11. https://accounts.google.com/ServiceLogin?hl=sv&passive=true&continue=http://www.google.se/
12. http://www.google.se/advanced_search?hl=sv&authuser=0
13. http://www.google.se/language_tools?hl=sv&authuser=0
14. http://www.google.se/setprefs?sig=0_aqHPilDIYLLsVqHCw93WtOpJW0k=&hl=en&source=homepage&sa=X&ved=0ahUKEwifvrHulZHgAhXGjywKHVziBaIQ2ZgBCAQ
15. http://www.google.se/intl/sv/ads/
16. http://www.google.se/intl/sv/services/
17. https://plus.google.com/103822168634868962761
18. http://www.google.se/intl/sv/about.html
19. http://www.google.se/setprefdomain?prefdom=US&sig=K_KnzY40g4UTDMelz-_ON-EEbQQGM=
20. http://www.google.se/intl/sv/policies/privacy/
21. http://www.google.se/intl/sv/policies/terms/
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "3"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1399287%2fdelete-all-html-tags-out-of-an-excel-sheet-while-preserving-text-inbetween%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
The easiest i.e. only, good way I know of is by use of Lynx, the old text mode web browser:lynx --dump URL_or_HTML-file
- but then, you need to find a way to run it on Windows (e.g. cygwin, or WSL).
Oh... one more thing: You should realize; what you're asking for is essentially a web browser in its full glory.
lynx is that, even with its inherent limitations and shortcomings.
Example:
$ lynx --dump www.google.se
Sök [1]Bilder [2]Maps [3]Play [4]YouTube [5]Nyheter [6]Gmail [7]Drive
[8]Mer »
[9]Webbhistorik | [10]Inställningar | [11]Logga in
_______________________________________________________
Sök på Google Jag har tur [12]Avancerad sökning
[13]Språkverktyg
Google på: [14]English
[15]Annonsera med Google [16]Företagslösningar [17]+Google
[18]Allt om Google [19]Google.com
© 2019 - [20]Sekretess - [21]Villkor
References
1. http://www.google.se/imghp?hl=sv&tab=wi
2. http://maps.google.se/maps?hl=sv&tab=wl
3. https://play.google.com/?hl=sv&tab=w8
4. http://www.youtube.com/?gl=SE&tab=w1
5. http://news.google.se/nwshp?hl=sv&tab=wn
6. https://mail.google.com/mail/?tab=wm
7. https://drive.google.com/?tab=wo
8. https://www.google.se/intl/sv/about/products?tab=wh
9. http://www.google.se/history/optout?hl=sv
10. http://www.google.se/preferences?hl=sv
11. https://accounts.google.com/ServiceLogin?hl=sv&passive=true&continue=http://www.google.se/
12. http://www.google.se/advanced_search?hl=sv&authuser=0
13. http://www.google.se/language_tools?hl=sv&authuser=0
14. http://www.google.se/setprefs?sig=0_aqHPilDIYLLsVqHCw93WtOpJW0k=&hl=en&source=homepage&sa=X&ved=0ahUKEwifvrHulZHgAhXGjywKHVziBaIQ2ZgBCAQ
15. http://www.google.se/intl/sv/ads/
16. http://www.google.se/intl/sv/services/
17. https://plus.google.com/103822168634868962761
18. http://www.google.se/intl/sv/about.html
19. http://www.google.se/setprefdomain?prefdom=US&sig=K_KnzY40g4UTDMelz-_ON-EEbQQGM=
20. http://www.google.se/intl/sv/policies/privacy/
21. http://www.google.se/intl/sv/policies/terms/
add a comment |
The easiest i.e. only, good way I know of is by use of Lynx, the old text mode web browser:lynx --dump URL_or_HTML-file
- but then, you need to find a way to run it on Windows (e.g. cygwin, or WSL).
Oh... one more thing: You should realize; what you're asking for is essentially a web browser in its full glory.
lynx is that, even with its inherent limitations and shortcomings.
Example:
$ lynx --dump www.google.se
Sök [1]Bilder [2]Maps [3]Play [4]YouTube [5]Nyheter [6]Gmail [7]Drive
[8]Mer »
[9]Webbhistorik | [10]Inställningar | [11]Logga in
_______________________________________________________
Sök på Google Jag har tur [12]Avancerad sökning
[13]Språkverktyg
Google på: [14]English
[15]Annonsera med Google [16]Företagslösningar [17]+Google
[18]Allt om Google [19]Google.com
© 2019 - [20]Sekretess - [21]Villkor
References
1. http://www.google.se/imghp?hl=sv&tab=wi
2. http://maps.google.se/maps?hl=sv&tab=wl
3. https://play.google.com/?hl=sv&tab=w8
4. http://www.youtube.com/?gl=SE&tab=w1
5. http://news.google.se/nwshp?hl=sv&tab=wn
6. https://mail.google.com/mail/?tab=wm
7. https://drive.google.com/?tab=wo
8. https://www.google.se/intl/sv/about/products?tab=wh
9. http://www.google.se/history/optout?hl=sv
10. http://www.google.se/preferences?hl=sv
11. https://accounts.google.com/ServiceLogin?hl=sv&passive=true&continue=http://www.google.se/
12. http://www.google.se/advanced_search?hl=sv&authuser=0
13. http://www.google.se/language_tools?hl=sv&authuser=0
14. http://www.google.se/setprefs?sig=0_aqHPilDIYLLsVqHCw93WtOpJW0k=&hl=en&source=homepage&sa=X&ved=0ahUKEwifvrHulZHgAhXGjywKHVziBaIQ2ZgBCAQ
15. http://www.google.se/intl/sv/ads/
16. http://www.google.se/intl/sv/services/
17. https://plus.google.com/103822168634868962761
18. http://www.google.se/intl/sv/about.html
19. http://www.google.se/setprefdomain?prefdom=US&sig=K_KnzY40g4UTDMelz-_ON-EEbQQGM=
20. http://www.google.se/intl/sv/policies/privacy/
21. http://www.google.se/intl/sv/policies/terms/
add a comment |
The easiest i.e. only, good way I know of is by use of Lynx, the old text mode web browser:lynx --dump URL_or_HTML-file
- but then, you need to find a way to run it on Windows (e.g. cygwin, or WSL).
Oh... one more thing: You should realize; what you're asking for is essentially a web browser in its full glory.
lynx is that, even with its inherent limitations and shortcomings.
Example:
$ lynx --dump www.google.se
Sök [1]Bilder [2]Maps [3]Play [4]YouTube [5]Nyheter [6]Gmail [7]Drive
[8]Mer »
[9]Webbhistorik | [10]Inställningar | [11]Logga in
_______________________________________________________
Sök på Google Jag har tur [12]Avancerad sökning
[13]Språkverktyg
Google på: [14]English
[15]Annonsera med Google [16]Företagslösningar [17]+Google
[18]Allt om Google [19]Google.com
© 2019 - [20]Sekretess - [21]Villkor
References
1. http://www.google.se/imghp?hl=sv&tab=wi
2. http://maps.google.se/maps?hl=sv&tab=wl
3. https://play.google.com/?hl=sv&tab=w8
4. http://www.youtube.com/?gl=SE&tab=w1
5. http://news.google.se/nwshp?hl=sv&tab=wn
6. https://mail.google.com/mail/?tab=wm
7. https://drive.google.com/?tab=wo
8. https://www.google.se/intl/sv/about/products?tab=wh
9. http://www.google.se/history/optout?hl=sv
10. http://www.google.se/preferences?hl=sv
11. https://accounts.google.com/ServiceLogin?hl=sv&passive=true&continue=http://www.google.se/
12. http://www.google.se/advanced_search?hl=sv&authuser=0
13. http://www.google.se/language_tools?hl=sv&authuser=0
14. http://www.google.se/setprefs?sig=0_aqHPilDIYLLsVqHCw93WtOpJW0k=&hl=en&source=homepage&sa=X&ved=0ahUKEwifvrHulZHgAhXGjywKHVziBaIQ2ZgBCAQ
15. http://www.google.se/intl/sv/ads/
16. http://www.google.se/intl/sv/services/
17. https://plus.google.com/103822168634868962761
18. http://www.google.se/intl/sv/about.html
19. http://www.google.se/setprefdomain?prefdom=US&sig=K_KnzY40g4UTDMelz-_ON-EEbQQGM=
20. http://www.google.se/intl/sv/policies/privacy/
21. http://www.google.se/intl/sv/policies/terms/
The easiest i.e. only, good way I know of is by use of Lynx, the old text mode web browser:lynx --dump URL_or_HTML-file
- but then, you need to find a way to run it on Windows (e.g. cygwin, or WSL).
Oh... one more thing: You should realize; what you're asking for is essentially a web browser in its full glory.
lynx is that, even with its inherent limitations and shortcomings.
Example:
$ lynx --dump www.google.se
Sök [1]Bilder [2]Maps [3]Play [4]YouTube [5]Nyheter [6]Gmail [7]Drive
[8]Mer »
[9]Webbhistorik | [10]Inställningar | [11]Logga in
_______________________________________________________
Sök på Google Jag har tur [12]Avancerad sökning
[13]Språkverktyg
Google på: [14]English
[15]Annonsera med Google [16]Företagslösningar [17]+Google
[18]Allt om Google [19]Google.com
© 2019 - [20]Sekretess - [21]Villkor
References
1. http://www.google.se/imghp?hl=sv&tab=wi
2. http://maps.google.se/maps?hl=sv&tab=wl
3. https://play.google.com/?hl=sv&tab=w8
4. http://www.youtube.com/?gl=SE&tab=w1
5. http://news.google.se/nwshp?hl=sv&tab=wn
6. https://mail.google.com/mail/?tab=wm
7. https://drive.google.com/?tab=wo
8. https://www.google.se/intl/sv/about/products?tab=wh
9. http://www.google.se/history/optout?hl=sv
10. http://www.google.se/preferences?hl=sv
11. https://accounts.google.com/ServiceLogin?hl=sv&passive=true&continue=http://www.google.se/
12. http://www.google.se/advanced_search?hl=sv&authuser=0
13. http://www.google.se/language_tools?hl=sv&authuser=0
14. http://www.google.se/setprefs?sig=0_aqHPilDIYLLsVqHCw93WtOpJW0k=&hl=en&source=homepage&sa=X&ved=0ahUKEwifvrHulZHgAhXGjywKHVziBaIQ2ZgBCAQ
15. http://www.google.se/intl/sv/ads/
16. http://www.google.se/intl/sv/services/
17. https://plus.google.com/103822168634868962761
18. http://www.google.se/intl/sv/about.html
19. http://www.google.se/setprefdomain?prefdom=US&sig=K_KnzY40g4UTDMelz-_ON-EEbQQGM=
20. http://www.google.se/intl/sv/policies/privacy/
21. http://www.google.se/intl/sv/policies/terms/
answered Jan 28 at 19:04
HannuHannu
4,1451925
4,1451925
add a comment |
add a comment |
Thanks for contributing an answer to Super User!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1399287%2fdelete-all-html-tags-out-of-an-excel-sheet-while-preserving-text-inbetween%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
Parsing HTML with regex is hard job
– Toto
Jan 28 at 15:39