GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:24:59 Sequence gi568815589r:96135463_96402104 : 266642 bp : 44.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5448 5628 181 2 1 81 52 84 0.566 3.54 1.02 Intr + 10846 11516 671 2 2 74 35 660 0.712 50.72 1.03 Term + 11603 12394 792 1 0 23 41 475 0.751 29.60 1.04 PlyA + 12481 12486 6 1.05 2.00 Prom + 12925 12964 40 -1.66 2.01 Init + 17419 17465 47 0 2 71 97 39 0.816 3.26 2.02 Intr + 31704 31854 151 0 1 97 37 90 0.133 4.96 2.03 Intr + 32440 32541 102 0 0 19 66 97 0.153 0.97 2.04 Intr + 32759 32828 70 1 1 33 52 71 0.389 -3.25 2.05 Intr + 33467 33661 195 0 0 91 94 65 0.577 6.79 2.06 Term + 33901 33962 62 2 2 65 43 62 0.577 -2.63 2.07 PlyA + 34423 34428 6 1.05 3.21 PlyA - 34846 34841 6 1.05 3.20 Term - 36813 36525 289 2 1 35 48 155 0.392 1.05 3.19 Intr - 37125 37047 79 1 1 77 92 49 0.391 2.81 3.18 Intr - 38423 38339 85 2 1 92 45 85 0.278 3.99 3.17 Intr - 44398 44352 47 2 2 106 77 10 0.011 -0.17 3.16 Intr - 45620 45510 111 0 0 61 34 111 0.010 3.35 3.15 Intr - 48435 48320 116 2 2 86 18 73 0.004 0.19 3.14 Intr - 70356 70285 72 2 0 87 41 74 0.085 1.12 3.13 Intr - 74245 74187 59 1 2 77 105 37 0.177 1.98 3.12 Intr - 79024 78970 55 0 1 107 97 49 0.548 6.68 3.11 Intr - 97475 97279 197 2 2 102 58 57 0.017 2.31 3.10 Intr - 100108 100007 102 1 0 98 77 95 0.928 9.77 3.09 Intr - 108932 108867 66 2 0 100 94 86 0.923 9.50 3.08 Intr - 111128 111094 35 0 2 93 116 0 0.235 1.24 3.07 Intr - 117448 117341 108 2 0 75 95 85 0.479 8.16 3.06 Intr - 119446 119406 41 0 2 21 109 66 0.005 -0.13 3.05 Intr - 143971 143909 63 0 0 100 95 61 0.015 5.83 3.04 Intr - 151590 151514 77 0 2 78 1 123 0.018 0.91 3.03 Intr - 154105 154016 90 1 0 84 59 40 0.409 0.89 3.02 Intr - 163000 162954 47 2 2 79 99 51 0.934 3.43 3.01 Init - 166642 166489 154 1 1 110 110 146 0.994 17.18 3.00 Prom - 168753 168714 40 -5.66 4.13 PlyA - 170380 170375 6 1.05 4.12 Term - 179437 179318 120 1 0 120 38 45 0.534 1.17 4.11 Intr - 184715 184623 93 2 0 120 13 94 0.670 5.26 4.10 Intr - 186618 186536 83 1 2 130 97 -49 0.204 -0.44 4.09 Intr - 188707 188629 79 1 1 84 113 28 0.164 4.02 4.08 Intr - 207947 207810 138 2 0 74 85 13 0.340 0.26 4.07 Intr - 208534 208442 93 1 0 89 99 52 0.651 6.56 4.06 Intr - 209939 209837 103 1 1 109 76 20 0.417 3.08 4.05 Intr - 215709 215641 69 2 0 82 86 89 0.390 6.40 4.04 Intr - 218531 218485 47 2 2 77 39 45 0.125 -4.29 4.03 Intr - 229088 229002 87 2 0 107 115 17 0.326 6.47 4.02 Intr - 232843 232810 34 0 1 91 109 7 0.201 1.33 4.01 Init - 248172 248015 158 0 2 70 76 418 0.649 36.18 4.00 Prom - 250414 250375 40 -4.46 5.05 PlyA - 250503 250498 6 1.05 5.04 Term - 252997 252775 223 0 1 78 33 299 0.999 19.79 5.03 Intr - 257074 256936 139 2 1 37 55 165 0.927 7.62 5.02 Intr - 259480 259361 120 2 0 72 64 89 0.890 5.37 5.01 Init - 264186 264138 49 0 1 86 58 55 0.391 1.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 48382 48452 71 2 2 125 94 40 0.832 7.20 S.002 Term + 49913 49993 81 1 0 113 54 55 0.802 2.29 S.003 Term - 151590 151510 81 0 0 78 48 122 0.908 4.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:96135463_96402104|GENSCAN_predicted_peptide_1|547_aa CCRSGVLKAGGLARNREAWAAARGGTSSWSTLDQSRPGQSLARSPSEVELDPLHAQAEEQ GNLPYDVTEESIKEFFRGLNISAVRLPCEPSNPVRLKGFGYAEFEDLDSLLSALSLKEES LGNRRIRVDVADQAQDKDRDDCSFGRDRNRDSDKTGTDWRARPATDSFDDYPPRRGDDSF GDKYRDRYDSDRYRDGYRDGYRDGQRRDMDPYGGRDRYDDRGSRDYDRGYDSRIGSGRRA FGSGYRRDDVSEEAGTTMKTDMTDGMIGLGAPEMITLWMIIGVMSTRAASIFGGAKPVDT AAREREVEERLQKEQEKLQHQLDEPKLERRPPERHPSWRSEETQERERSRTGSESSQTGT STTSGRSKSAQDARRRENEKSLENETLNKEEDCHSPTSKPPKPDQPLKVMPAPPPKENAW VKRSSNPPARSQSSDTEQQSPTSGEGKVAPAQPSEEGPARKDENKVDGMNVPKGQTGNSS RGPGDGGNKDHWKESDRKDGKKDQDSRSAPEPKKPEENPASKFSSASKYAALSVDGEDEN EGEDYAE >gi568815589r:96135463_96402104|GENSCAN_predicted_CDS_1|1644_bp tgctgtcgaagtggcgtcctgaaagccgggggccttgcccggaacagggaggcttgggct gcggccaggggaggcaccagcagctggagcacactagaccagagccggccaggacagagc ctggcccgatctccctcggaggtagaactggacccccttcatgctcaggctgaggagcag ggaaacctaccctatgatgttacagaagagtcaattaaggaattctttcgaggattaaat atcagtgcagtgcgtttaccatgtgaacccagcaatccagtgaggttgaaaggttttggt tatgctgaatttgaggacctggattccctgcttagtgccctgagtctcaaggaagagtct ctaggtaacaggagaattcgagtggacgttgctgatcaagcacaggataaagacagggat gattgttcttttggccgtgatagaaatcgggattctgacaaaacaggtacagactggagg gctcgtcctgctacagacagctttgatgactacccacctagaagaggtgatgatagcttt ggagacaagtatcgagatcgttatgattcagaccggtatcgggatgggtatcgggatggg tatcgggatggccaacgccgggatatggatccatatggtggccgggatcgctatgatgac cgaggcagcagagactatgatagaggctatgattcccggataggcagtggcagaagagca tttggcagtgggtatcgcagggatgatgtctcagaggaggcggggaccactatgaagacc gatatgacagacgggatgatcggtcttggagctccagagatgattactctctggatgatt ataggcgtgatgtccactcgagctgcttctatctttggaggggcaaagcccgttgacaca gctgctagagaaagagaagtagaagaacggctacagaaggaacaagagaagttgcagcat cagctggatgagccaaaactagaacgacggcctccggagagacacccaagctggcgaagt gaagaaactcaggaacgggaacggtcgaggacaggaagtgagtcatcacagactgggacc tccaccacatctggcagaagtaagtcagcccaggatgcacgaaggagagagaatgagaag tctctagaaaatgaaacactcaataaggaggaagattgccactctccaacttctaaacct cccaaacctgatcagcccctaaaggtaatgccagcccctccaccaaaggagaatgcttgg gtgaagcgaagttctaaccctcctgctcgatctcagagctcagacacagagcagcaatcc cctacaagtggtgagggaaaagtagctccagctcaaccatctgaggaaggaccagcaagg aaagatgaaaacaaagtagatgggatgaatgtcccaaaaggccaaactgggaactctagc cgtggtccaggagatggagggaacaaagaccactggaaggagtcagataggaaagatggc aaaaaggatcaagactccagatctgcacctgagccaaagaaacctgaggaaaatccagct tccaagttcagttctgcaagcaagtatgctgctctctctgttgatggtgaagatgaaaat gagggagaagattatgccgaatag >gi568815589r:96135463_96402104|GENSCAN_predicted_peptide_2|208_aa MNKHRDFQVALIERDSEKCRALLSFEPNARAKAGLLLEVRLPQLRSSHARRKRQQASSWT FFQVPQRLYIQLEESEVNMIITKANIPPMTSQERKRCVEMTQHEKDSIERAPVQAASGIH WEPGTLQYTEEIKASFQKPAVQLRTIQACACTAPPKDLPMQMAGPLSTASVSAGLGQGLR ISISNKAPGKALVTVSSGTAQLPGMTKR >gi568815589r:96135463_96402104|GENSCAN_predicted_CDS_2|627_bp atgaataaacacagagactttcaagttgccttgattgaacgggacagcgagaaatgcaga gctctcctgtcatttgaacctaatgctcgggccaaggcaggcctgctcctcgaggtccgc ctacctcagcttagatccagccatgcgcgcaggaagcggcagcaggccagctcatggacg ttcttccaggtccctcaaaggctgtacatccagctagaagaatccgaagtcaacatgata atcacaaaggcaaacattccaccaatgacaagtcaagagcgcaagagatgtgttgagatg acccagcacgagaaggacagcattgaaagagcccctgtccaggcggcttctggcattcac tgggaaccagggaccctacagtacacagaagagataaaagcaagcttccagaaacctgca gtccagcttcgcactatacaagcctgcgcgtgcacggcgcctcccaaggacttgcccatg cagatggctgggcccctttccacagcatctgtttccgcaggtctgggtcagggcctgaga atcagcatttctaacaaggctccagggaaggccctggtcacagtctcctcagggacagct cagctgcctggaatgaccaagcgctga >gi568815589r:96135463_96402104|GENSCAN_predicted_peptide_3|630_aa MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGI GKAYSFEEADEVNWETKVLNSLAILLMDSKTQKWRLEERGPDPDPKRVFSDLIKERIQGK SIKCLDKLFQPITNQEIFESTYDLRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEH IKEKLAGLEIGILDDTANSETYGIKAFVCAFSKALQEEYKAKEVIIQAGFLSLIPAWAFY SGAFQRLLLTHYVAYLKLNTKPSICATSQEAEGRRVPFGWHWEQENEVIMGCCSKKYWQL LLGRLPGVSSLSCSCGWEPEHPTSKTLLMHSASHVGIGDEADYVAEVNAGCAPVECKGIL RTSKREIRSNTKMKVLEMQPNNKKKPMAGVIIRLQRATSWRCSLNVTCEEEQWSRGMQRK KRYRGRRDCGMYSQEITDSQDAWQQVGGQNFWRWNLLDSVSPEKVPSEAPCYPSTMGIVM SMRTVCSSKRDLPAAMAVSEAKGKGKPKEYSGQANFPDSRNQCKVGHKDPQGLVINAAMP SKAHGPLYPLPVEPEPKTLPFHLTNVFIEDLEYSKSYSRPVDFKCGLQTSSFGITQKPVS NANSQAPSLAKPEALGVSPASVSNKPTRRL >gi568815589r:96135463_96402104|GENSCAN_predicted_CDS_3|1893_bp atgggggacgtcctggaacagttcttcatcctcacagggctgctggtgtgcctggcctgc ctggcgaagtgcgtgagattctccagatgtgttttactgaactactggaaagttttgcca aagtctttcttgcggtcaatgggacagtgggcagtgatcactggagcaggcgatggaatt gggaaagcgtactcgttcgaggaggctgatgaagtgaattgggagacaaaggtgcttaat tctttagccatcctgttgatggattccaagactcaaaaatggaggctagaggaaaggggt cctgatccagaccccaagagagtgttctcggacctcatcaaagaaagaattcagggcaag tccataaagtgtttagataaactttttcaaccaattaccaatcaggaaatctttgaatcc acctatgacctccggacgctggaaaaactagaggccattgccacagagatcgagcggact acagggaggagtgtgaagattatacaagcagattttacaaaagatgacatctacgagcat attaaagaaaaacttgcaggcttagaaattggaattttagatgacacagctaattctgaa acatatggaatcaaggcgtttgtgtgcgcattttccaaggccctgcaagaggaatataaa gcaaaagaagtcatcatccaggcgggctttctgagcctgatcccggcctgggccttctac agcggtgccttccaaaggctgctcctgacacactatgtggcatacctgaagctcaacacc aagccttccatctgtgctacctcccaggaagccgaaggccgcagagtccctttcggatgg cactgggagcaggaaaatgaggtgattatgggctgctgctccaagaagtattggcagctg ttgctggggcggctccctggggtgtcatccctttcttgctcttgtggatgggaaccagag caccccacttcaaagactctgctcatgcactctgccagtcatgtgggcattggcgatgag gcagactatgttgccgaagttaatgcaggatgtgctccagtggaatgtaaaggtatccta cgaacaagcaagagagagattagatcaaacaccaagatgaaagtgttggagatgcagccc aataacaaaaagaaacccatggctggggtaattataaggcttcagcgagctacgtcctgg aggtgctccctgaatgttacctgtgaggaggagcagtggagcaggggaatgcagaggaag aagaggtatcgtggaagaagggattgtgggatgtacagccaggagatcacagactctcag gatgcctggcagcaagtggggggccagaatttctggcggtggaacttactggacagcgtc agccctgagaaggttccctctgaagctccctgttacccaagcacaatgggaattgtaatg agcatgcggacagtctgctcctcaaagagggacttgcctgcagccatggctgtttcagag gccaaaggaaaagggaaaccgaaagaatacagcggccaagcaaactttcctgacagcaga aatcaatgcaaggtgggccataaagacccacaaggccttgttataaatgctgcaatgccg tccaaagctcacggccctctctatcctttaccagtagaaccagaacccaagacactgcct tttcatttaaccaacgtatttattgaggacctggaatactccaagtcttattctaggcca gtggatttcaaatgtggtctccagaccagcagcttcggcatcacccagaaacctgtgagc aatgcaaactctcaggcccccagccttgctaagccagaagctctgggggtcagccctgca tctgtgtctaacaagcccaccaggcgactctga >gi568815589r:96135463_96402104|GENSCAN_predicted_peptide_4|367_aa MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLLSGCSVVDDVKPVLGKQYSLNIILSV FAIILGAFIAAGSDLAFNLEGYIFVFLNDIFTAANGVYTKQKMDPKELGKYGVLFYNACF MIIPTLIISVSTGDLQQLFVPLEYFKTYPSHHAILPLNIPGSTFEFWEGVVLSAFALKPY LLLVSADVLHGSVQLLQFSPDDSSGWSHQECIRCLHWDINRWRLHFLFVKLCRVKYLVCS GAFTTGAFPHMLTYRQPLVYGKEAQMDRLDLVEVLPQVGIAKKGSFSSRFLGNWIQLGED HLNSHPI >gi568815589r:96135463_96402104|GENSCAN_predicted_CDS_4|1104_bp atgacggccggcggccaggccgaggccgagggcgctggcggggagcccggcgcggcgcgg ctgccctcgcgggtggcccggctgctgtcggcgctcttctacgggacctgctccttcctc atcgtgcttgtcaacaaggcgctgctgaccacctacggtttcccgtcaccaattttcctt ggaattggacagatggcagccaccataatgatactatatgtgtccaagctaaacaaaatc attcacttccctgattttgataagaaaattcctgtaaagctgctctctggatgcagtgtg gttgatgatgtgaagcctgtgctggggaagcagtattcactcaacatcatcctcagtgtc tttgccattattctcggggctttcatagcagctgggtctgaccttgcttttaacttagaa ggctatatttttgtattcctgaatgatatcttcacagcagcaaatggagtttataccaaa cagaaaatggacccaaaggagctagggaaatacggagtacttttctacaatgcctgcttc atgattatcccaactcttattattagtgtctccactggagacctgcaacagctttttgtt cctctggagtattttaaaacctatcccagccatcacgccattttacctctaaatattcca ggatccacatttgaattttgggaaggtgttgttctttcagcatttgccctgaaaccgtat ctgctgctggtttctgctgatgtactccacggttctgtgcagctattacaattcagccct gacgacagcagtggttggagccatcaagaatgtatccgttgcctacattgggatattaat cggtggagactacattttctctttgttaaactttgtagggttaaatatttggtttgttca ggagcttttaccacaggtgcatttcctcacatgctcacctaccgccaacccctggtgtat gggaaggaggcccaaatggataggctcgacttggtggaagttctgcctcaggtgggcata gccaaaaaggggtcattctcttctcggttcctggggaattggatccaactgggagaagac cacctgaattctcatcccatctaa >gi568815589r:96135463_96402104|GENSCAN_predicted_peptide_5|176_aa MGFRHVGQAGLELLTSGERPYLCDYPDCGKAFVQSGQLKTHQRLHTGEKPFVCSENGCLS RFTHANRHCPKHPYARLKREEPTDTLSKHQAADNKAAAEWLARYWEMREQRTPTLKGKLV QKADQEQQDPLEYLQSDEEDDEKRGAQRRLQEQRERLHGALALIELANLTGAPLRQ >gi568815589r:96135463_96402104|GENSCAN_predicted_CDS_5|531_bp atggggtttcgccatgttggccaggctggtcttgaactcctgacctcaggtgagaggccc tatctgtgtgactatccagactgtggaaaagcctttgttcaaagtggacagctcaaaaca catcagcgtcttcacaccggagagaaaccttttgtttgttcagaaaatggctgcctgagc agattcacccatgcaaaccgccactgtccgaagcacccctacgccaggctgaagagagag gagcccacggacacactcagcaaacatcaggctgccgacaacaaggccgcggccgagtgg ctggcgaggtattgggaaatgagagagcagcgcacccccactttgaaaggcaagctggtt cagaaggctgatcaggagcagcaggaccctctggaataccttcagtctgatgaagaggac gacgagaagagaggggcccagcgccggctgcaggagcagcgggagcgcctgcatggagcc ctcgcgctcatagagcttgccaacctgactggggcgccactccgacagtag