GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:52:34 Sequence gi568815593r:90418822_90625289 : 206468 bp : 37.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 25191 25898 708 2 0 86 49 149 0.826 6.78 1.02 PlyA + 27153 27158 6 1.05 2.03 PlyA - 29449 29444 6 1.05 2.02 Term - 42731 42346 386 0 2 84 35 296 0.778 17.97 2.01 Init - 49788 49770 19 0 1 56 94 11 0.221 -1.26 2.00 Prom - 50842 50803 40 -4.55 3.05 PlyA - 51121 51116 6 1.05 3.04 Term - 55735 54872 864 1 0 -19 48 546 0.241 31.61 3.03 Intr - 56009 55886 124 1 1 69 43 102 0.215 3.47 3.02 Intr - 56222 56135 88 0 1 54 28 106 0.361 -0.69 3.01 Init - 58049 57974 76 2 1 98 78 76 0.580 8.90 3.00 Prom - 60107 60068 40 -9.55 4.00 Prom + 60330 60369 40 -8.35 4.01 Init + 66747 66863 117 2 0 39 95 98 0.932 5.85 4.02 Intr + 69179 69308 130 1 1 51 79 112 0.996 5.95 4.03 Intr + 73236 73349 114 1 0 27 74 102 0.430 2.20 4.04 Intr + 76856 76912 57 0 0 127 93 27 0.656 5.04 4.05 Intr + 83088 83167 80 1 2 37 94 55 0.636 -0.65 4.06 Intr + 87707 87853 147 1 0 31 80 334 0.934 26.41 4.07 Term + 93232 93318 87 0 0 16 48 100 0.201 -4.52 4.08 PlyA + 94980 94985 6 1.05 5.06 PlyA - 95055 95050 6 1.05 5.05 Term - 100663 99998 666 1 0 134 43 442 0.999 37.14 5.04 Intr - 106479 106214 266 1 2 104 127 199 0.585 21.71 5.03 Intr - 110800 110627 174 2 0 42 103 193 0.962 15.19 5.02 Intr - 111210 111080 131 2 2 40 50 95 0.134 0.32 5.01 Init - 116496 116456 41 0 2 54 77 70 0.198 2.21 5.00 Prom - 134991 134952 40 -2.35 6.00 Prom + 139019 139058 40 -2.75 6.01 Init + 139755 139858 104 2 2 64 99 95 0.324 8.04 6.02 Intr + 139975 140096 122 1 2 12 96 55 0.075 -2.08 6.03 Intr + 156762 157099 338 1 2 93 88 132 0.012 8.01 6.04 Intr + 162371 162479 109 1 1 39 81 41 0.000 -2.56 6.05 Intr + 165720 165815 96 1 0 76 58 67 0.000 1.56 6.06 Intr + 196014 196198 185 1 2 106 103 109 0.750 12.79 6.07 Intr + 198983 199132 150 1 0 36 111 117 0.996 8.24 6.08 Intr + 203776 203880 105 0 0 93 107 65 0.993 8.39 6.09 Intr + 206309 206422 114 1 0 116 36 109 0.718 8.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_1|235_aa MAILPKVIYRFNAIPIKLPKTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPD FKLYYKATVTKTARYWYQNRDIDQWNRTEPSEITPRTYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSYIKVY >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_1|708_bp atggccatactgcccaaggtaatctacagattcaatgccatccccatcaagctaccaaag actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcacggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgccgcgtacctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtccaaaacaccaaaagcaatggcaaccaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca acatgggagaaaatttttgcaacctactcatatataaaagtatactga >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_2|134_aa MIQDMKGDVINLGDRQLTVMHMPGHSRGSICLHDKDRKILFSGDVVYDGSLIDWLPYSRI SDYVGTCERLIELVDRGLVEKVLPGHFNTFGAERLFRLASNYISKAGICHKVSTFAMRSL ASLALRVTNSRTSP >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_2|405_bp atgatacaagatatgaaaggggatgtgatcaaccttggtgacagacagctcactgttatg cacatgccaggtcactccaggggcagtatttgcttacatgacaaagaccgaaagattctc ttcagtggagacgtcgtgtatgatggatcactgattgactggctcccatacagcaggata agtgactatgttggaacttgtgaacgtctaatagaattagtggacagaggtctggtagag aaggtgcttcctgggcacttcaatacctttggtgctgaaaggctttttcgattggcttct aactatatttcaaaagctgggatatgtcacaaagtttctacttttgccatgcgatctctt gcaagtttagctctacgtgtaacaaattctaggacctcgccctag >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_3|383_aa MDAIVERVAEEVLSYLVIPEHKPDKIFDCFLGYLLPKSRSVGKTLQPKCRPARKGKAPPP RQRGFFALGRRRSRRTTREAPHFLRQLGVRALQSQWLLRPRAGGAASACCRPGGRVRERT LRAASVCVLPPPIPTPASHAPASLYASSLAAGRPLHTAAVPCVWTACGRLARPHSPRLAA PLPSMSALEWYAHKSLGDGIFWIQERFYESGNRANIWLVRGSEQDVVIDTGLGLRSLPEY LYSSGLLQDREAKEDAARRPLLAVATHVHFDHSGGLYQFDRVAVHHAEAEALARGDNFET VTWLSDSEVVRTPSPGWRARQFRVQAVQPTLILQDGNGPPRARALVKGGYWEETVRVPHV ADLRGHKGDYFGTSQSHFCFTLP >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_3|1152_bp atggatgctattgtggaaagggtggctgaagaagtcctctcttacttggtgatacctgag cataagcctgacaaaatcttcgattgcttcctcggttacctgctccctaagtctcgaagc gtgggcaaaactctgcagccgaagtgccggccggcaaggaaagggaaggccccaccccct cggcaaagaggtttttttgcccttggcaggcgccgtagtcgccgtacaacccgcgaggcc ccacactttctgcgccagctgggagtgcgcgccctgcagagccagtggctcctgcgcccc cgtgctggtggcgcagcctctgcttgctgccgacctggcggaagggtgcgggagcgcacg ctacgggcagcgtcggtctgcgtccttcctcctcctattccgacccccgcatcccacgcc cccgcctcgctctacgcctcctccctggctgcaggcagaccgcttcacacagccgcagtg ccctgtgtgtggacagcctgtgggagactcgcacgcccacactcaccccgcctggctgca cccctgcccagcatgtcggcgctcgagtggtacgcccacaagtctctaggcgatggtatc ttctggattcaagaacgtttctacgagtcgggcaaccgtgccaacatctggctggtgcgc ggctccgagcaggacgtggtgatcgatacaggcctggggctgcgcagcctcccggagtac ctgtactcctccggcctcttgcaggaccgagaggccaaagaggacgcggcgcgccggcca ctgcttgccgtggccacccacgtgcacttcgaccactccggcggcctctaccagttcgac cgcgtggcagtgcaccacgccgaggccgaggcgctggctcgcggggacaactttgagacc gtgacctggctttccgatagcgaggtggtgcggacgcccagccccggctggagggccaga cagttccgggtacaggcggtgcagcccaccctcatcctgcaggatggtaatgggcccccg cgggcgcgcgctctcgttaagggagggtattgggaagagactgtccgagtgccgcatgtt gcagacttgcgtggccataaaggggattattttggcacctcgcagagccacttctgtttc acccttccctga >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_4|243_aa MAGNKGRGRAAYTFNIEAVGFSKGEKLPDVVLKPPPLFPDTDYKPVPLKTGEGEEYMLAL KQELRETMKRMPYFIETPEERQEKHVPQFLDESEVKCKEIDSSLDAHDDDEIDCISKISD YIERYSKRYMKVYKEEWIPGPKPKKAKDAGKGTPLTNTEDVLKKMEELEKRGDGEKSDEE NEEKEGSKEKSKEGDDDDDDDAAEQEEYDEEEQEEENDYINSYFEDGDDFGADSDDNMDE ATY >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_4|732_bp atggctgggaataaaggaagaggacgtgctgcttatacctttaatattgaggctgttgga tttagcaaaggtgaaaagttacctgatgtagtgttgaaaccacccccactatttcctgat acagattataaaccagtgccactgaaaacaggagaaggtgaagaatatatgctggctttg aaacaggagttgagagaaacaatgaaaagaatgccttattttattgaaacacctgaagaa agacaagaaaaacatgttccacaattcttagacgaatcagaagtcaaatgcaaagagata gatagctctctggatgctcatgatgatgatgaaattgattgtataagcaaaatctcagac tatattgaaaggtatagtaaaagatacatgaaggtatacaaggaagaatggataccaggc ccaaaacccaaaaaggcaaaagacgcaggcaaaggcacaccactcactaatactgaagat gtgttgaaaaaaatggaggaattggaaaaaagaggtgatggtgaaaaatcagatgaggaa aatgaagagaaagaaggaagcaaagagaaaagtaaagaaggtgatgatgacgatgacgat gatgccgcagaacaggaggaatatgatgaagaagagcaagaagaggaaaatgactacatt aattcatactttgaagatggagatgattttggcgcagacagtgatgacaacatggatgag gcaacctattag >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_5|425_aa MADIEKKVNKAHGRKFEMHSKKKMQDSSPTHPEPRLVLRVHPTVSASAVWTLDKDAASLR TWRRRTARKAETLAAEAEADGVSPKQRLGHELTRDSGGGGPFGHLLAECSLLTGTDFNIM AGRHQNRSFPLPGVQSSGQVHAFGNCSDSDILEEDAEVYELRSRGKEKVRRSTSRDRLDD IIVLTKDIQEGDTLNAIALQYCCTVADIKRVNNLISDQDFFALRSIKIPVKKFSSLTETL CPPKGRQTSRHSSVQYSSEQQEILPANDSLAYSDSAGSFLKEVDRDIEQIVKCTDNKREN LNEVVSALTAQQMRFEPDNKNTQRKDPYYGADWGIGWWTAVVIMLIVGIITPVFYLLYYE ILAKVDVSHHSTVDSSHLHSKITPPSQQREMENGIVPTKGIHFSQQDDHKLYSQDSQSPA AQQET >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_5|1278_bp atggcagatattgagaaaaaggtgaacaaggctcatgggagaaaattcgagatgcacagc aagaaaaaaatgcaagacagctccccgacacatcctgagccacgcctcgtcctcagggtg cacccgacagtctctgccagcgcagtgtggacgctggacaaagacgcagccagtttgcgg acgtggcggcggcgtacggcccggaaggcggagacgttggcggcagaggcggaggcggac ggggtcagcccaaagcagaggctcggccatgaacttacccgggacagcggcggcggcgga ccttttggccatcttctcgcagagtgctccctgctaacggggacagattttaacattatg gcagggaggcatcagaatcgtagttttcctcttccaggagttcagtcaagtggtcaagta catgcatttggaaattgttcagacagtgatattttggaggaggatgctgaagtgtatgaa cttcgatccagaggaaaagagaaagtccgaagaagtacatcaagagatagacttgacgac attatagtattaacaaaagatatacaagaaggagatacattaaatgcaatagcccttcag tactgttgtacggtagcagatatcaagagagttaacaatctcatcagtgatcaagacttt tttgcccttaggtctatcaaaattccagtaaaaaagttcagttccttgaccgaaacactt tgtcctccaaaaggaagacagacttcacgtcattcatctgttcaatactcttccgaacaa caggaaattttgccagctaatgattctcttgcttacagtgactcagctggtagcttttta aaagaagtagaccgagacatagaacaaatagtaaagtgtacagacaataagagagagaac ctcaatgaggtagtatcggccttaacagcacaacaaatgcgttttgaacctgataacaaa aacactcaacgtaaagacccctattatggagcagactggggaatagggtggtggacagct gtagtgataatgttgatagtaggtataataacaccagtgttttatttgttgtattatgaa attttagctaaggtggatgttagtcatcattcaacagtggactcttcacatttacattca aaaatcacacccccatcacagcagagagaaatggaaaatggaattgtgccaactaaagga atacatttcagccaacaagatgatcataaactgtatagtcaagattctcagtcacctgct gctcaacaggaaacatag >gi568815593r:90418822_90625289|GENSCAN_predicted_peptide_6|441_aa MAKAGCSRRQSADLAARCGEQSAQPGAATPAPLVSGSKNQQRGQGVRTGVRGRARVCGGP AGTAGSARMSVFLGPGVSPPMNGQRKLHPRPILSPRWLWGPALPQGFVYRPGNFDHERQT PSNWRLTSGNCSDYSQRAYMGAGHPGEFSGMIQICLLGTEHHSPGSQCCLDSRKPVSAAP GVPLGVAEVRTCSFSSEKFVITDLLKPTSVTLSKSFSVQLCSVAGFLGQPTWGWSILFQD VLLKMAGKLMLAVSSPWMPSASLLVNLLSALLILFVFGETEIRFTGQTEFVVNETSTTVI RLIIERIGEPANVTAIVSLYGEDAGDFFDTYAAAFIPAGETNRTVYIAVCDDDLPEPDET FIFHLTLQLPSIAVSEPKGRNESMPLTLIREKGTYGMVMVTFEVEGGPNPPDEDLSPVKG NITFPPGRATVIYNLTVLDDE >gi568815593r:90418822_90625289|GENSCAN_predicted_CDS_6|1323_bp atggccaaggcagggtgttcccgacgccagagcgcggacctggctgcccgctgcggagag cagagtgcgcagcccggggcggccacccccgctccgctggtcagtggtagtaagaatcag cagcgcgggcaaggagtacggacgggagtcagaggcagagcgagggtgtgtggagggccg gcggggaccgccgggagcgcgcggatgtcggtgttcctggggccaggggtttctcctccc atgaatggacagcgaaagcttcatcctagaccaatcctgtctccacgatggctttgggga cctgccttgccccagggttttgtctacaggcctggcaactttgaccatgagaggcagact ccctctaactggaggctgacaagtggaaactgttcggattattcccagagggcttacatg ggggcgggtcatcctggtgaattctcaggcatgatccaaatctgcctcctaggcacagaa caccattctccagggtcgcagtgctgcctggactccaggaagccagtctctgctgctcct ggtgtgcccctaggggtggccgaggttagaacatgctcatttagctcggagaagtttgtt attaccgaccttctgaagcctacttctgtcaccttgtcaaagtcattctccgtccagctt tgttccgttgctgggttcttggggcagcccacctggggctggagtatccttttccaagat gtcttactcaagatggctggcaaattgatgctggctgttagttctccatggatgccctct gcatctttattagtaaatcttctttcagctttactcatcctatttgtgtttggagaaaca gaaataagatttactggacaaactgaatttgttgttaatgaaacaagtacaacagttatt cgtcttatcattgaaaggataggagagccagcaaatgttactgcaattgtatcgctgtat ggagaggacgctggtgacttttttgacacatatgctgcagcttttatacctgccggagaa acaaacagaacagtgtacatagcagtatgtgatgatgacttaccagagcctgacgaaact tttatttttcacttaacattacagcttccctcaatcgcagtgagtgagcccaagggcaga aatgagtctatgcctcttactctcatcagggaaaagggaacctatggaatggtcatggtg acttttgaggtagagggtggcccaaatccccctgatgaagatttgagtccagttaaagga aatatcacctttccccctggcagagcaacagtaatttataacttgacagtactcgatgac gag