GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:15:56 Sequence gi568815589f:89211293_89416422 : 205130 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1115 1143 29 1 2 83 68 94 0.301 4.08 1.02 Intr + 10410 10564 155 0 2 127 -2 66 0.082 1.22 1.03 Term + 15546 15682 137 1 2 63 52 103 0.711 2.38 1.04 PlyA + 15940 15945 6 1.05 2.00 Prom + 25519 25558 40 -4.46 2.01 Init + 30242 30300 59 1 2 78 100 9 0.617 1.91 2.02 Intr + 34777 34837 61 1 1 94 64 68 0.840 3.74 2.03 Intr + 35201 35417 217 1 1 126 77 74 0.964 8.18 2.04 Intr + 44009 44082 74 0 2 122 86 17 0.840 4.03 2.05 Intr + 49772 49904 133 1 1 85 68 42 0.420 2.12 2.06 Intr + 59496 59577 82 1 1 88 78 -2 0.002 -2.50 2.07 Intr + 71378 71676 299 2 2 59 76 149 0.029 7.32 2.08 Intr + 97583 97669 87 0 0 78 106 45 0.095 5.24 2.09 Intr + 99387 99580 194 1 2 78 28 130 0.180 5.21 2.10 Intr + 99989 100059 71 1 2 88 68 193 0.111 15.28 2.11 Intr + 103878 104005 128 0 2 85 115 -4 0.072 2.32 2.12 Intr + 107116 107320 205 2 1 79 94 118 0.069 9.76 2.13 Intr + 114303 114384 82 0 1 17 113 69 0.051 1.94 2.14 Intr + 114605 114746 142 1 1 78 98 28 0.868 2.73 2.15 Intr + 117368 117594 227 0 2 66 100 80 0.802 4.70 2.16 Intr + 121616 121694 79 1 1 111 83 16 0.540 2.52 2.17 Intr + 123230 123438 209 0 2 20 102 91 0.364 2.50 2.18 Intr + 130055 130187 133 1 1 54 110 67 0.718 5.72 2.19 Intr + 135590 135756 167 0 2 59 95 101 0.975 7.58 2.20 Intr + 136787 136922 136 1 1 36 81 120 0.744 6.24 2.21 Intr + 138484 138637 154 2 1 64 64 90 0.738 3.33 2.22 Intr + 139340 139560 221 2 2 71 98 129 0.957 10.15 2.23 Intr + 146119 146273 155 2 2 78 109 136 0.998 14.49 2.24 Intr + 146707 146899 193 0 1 60 79 137 0.906 9.07 2.25 Term + 147429 147532 104 1 2 98 54 27 0.706 -1.26 2.26 PlyA + 148331 148336 6 1.05 3.21 PlyA - 149521 149516 6 1.05 3.20 Term - 150459 150304 156 0 0 62 47 59 0.278 -2.87 3.19 Intr - 152235 152138 98 1 2 80 106 42 0.912 4.93 3.18 Intr - 152658 152449 210 1 0 61 119 178 0.848 17.08 3.17 Intr - 161669 161502 168 0 0 73 94 32 0.159 2.22 3.16 Intr - 165657 165541 117 1 0 41 109 121 0.181 9.94 3.15 Intr - 168337 167453 885 2 0 74 39 964 0.048 81.40 3.14 Intr - 169806 169763 44 2 2 97 100 37 0.998 3.68 3.13 Intr - 170054 169882 173 2 2 86 80 256 0.990 23.34 3.12 Intr - 171488 171291 198 2 0 68 40 110 0.744 3.75 3.11 Intr - 173696 173592 105 2 0 91 75 49 0.939 4.31 3.10 Intr - 175190 175075 116 0 2 72 98 206 0.794 20.17 3.09 Intr - 176316 176094 223 0 1 67 94 370 0.722 33.20 3.08 Intr - 177500 177344 157 1 1 102 89 244 0.932 25.91 3.07 Intr - 177755 177580 176 2 2 72 87 243 0.998 21.34 3.06 Intr - 180123 179972 152 1 2 100 113 278 0.999 31.38 3.05 Intr - 181244 181101 144 0 0 97 92 13 0.561 2.85 3.04 Intr - 182363 182270 94 2 1 87 97 84 0.998 8.74 3.03 Intr - 185543 185445 99 2 0 109 95 113 0.999 14.41 3.02 Intr - 191724 191579 146 1 2 117 94 252 0.993 28.70 3.01 Init - 194158 194059 100 1 1 40 105 94 0.360 4.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 77474 77528 55 1 1 71 100 77 0.854 8.65 S.002 Term - 168337 167412 926 2 2 74 54 1044 0.946 91.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:89211293_89416422|GENSCAN_predicted_peptide_1|106_aa MLMVRLLAVLENIIRYGFSRLLKTLKAIEHHFRFFTLMLDTREQLTSGEQTIAAAGRQLE PREKAFLLLFIECWETVEHQNQAMPPTQRNFLVTLGIPPTWTSGLP >gi568815589f:89211293_89416422|GENSCAN_predicted_CDS_1|321_bp atgctgatggtccggctgcttgctgtgctggaaaacattatccgttatggtttctctagg ctactgaagactctgaaagccatagagcatcattttaggttctttacattgatgttagat actagagagcagctgacttcaggagaacagacaatagcagcagccggcaggcaactagag cccagagaaaaagcctttctactgcttttcatcgagtgttgggaaacagtggagcaccag aaccaggccatgcctcccacccagaggaatttcctggtgactctgggcatcccacctacc tggacctctggcctaccttga >gi568815589f:89211293_89416422|GENSCAN_predicted_peptide_2|1203_aa MRMRPTELHSQEVKVFLVARAWNAHKKANEAGMSRKLRLGVWAMTGVLTGHEHQSTQYPA LIMPFAKKPYPTTKPTLSLLATVSPDTPHTYLFVPITLVCDSLLPGCMDLHKGEPTTGQP QRQRGTMRAKPAAFQVVPFHQGHQTERLKTSYALRISLTICTNTFSVPDNLKEEPTMHSY PMLLSGFGNQDNAGFIKGVWNVSSSSIFLAPDLQPAMPEPPTPSMGSCGARVSPISAAPC STAPSPIDHPRAEESGRRAQDWQAASPAAPVRDPVGEASWVESGGDVANLLSSSGIVNTP IGTLYLAQGKGSSPDPKGEFLDLVQERIQGESTVQNEKPFCLLLLFKVDLGSWLEPEGIQ GITRSAITVQGRQLPLKALWPTLPASTDYVITKPRDTSLHLQRASRMAHKQIYYSDKYFD EHYEYRHVMLPRELSKQVPKTHLMSEEEWRRLGVQQSLGWVHYMIHEPGPVPPPRTSGRK KGRALSLCCDALPGVACGGGNALSVRQADGPLLASVTRPPPRLAAWRRRGRGSPKASCYR GFQTVKHRNENTCPLPQEMKALFKKKTYDEKKTYDQQKFDSERADGTISSEIKSARGSHH LSIYAENSLKSDGYHKRTDRKSRIIAKNVSTSKPEFEFTTLDFPELQGAENNMSEIQKQP KWGPVHSVSTDISLLREVVKPAAVLSKGEIVVKNNPNESVTANAATNSPSCTRELSWTPM GYVVRQTLSTELSAAPKNVTSMINLKTIASSADPKNVSIPSSEALSSDPSYNKEKHIIHP TQKDNFKNNVKKSQLPVQLDLGGMLTALEKKQHSQHAKQSSKPVVVSVGAVPVLSKECAS GERGRRMSQMKTPHNPLDSSAPLMKKGKQREIPKAKKPTSLKKIILKERQERKQRLQENA VSPAFTSDDTQDGESGGDDQFPEQAELSGPEGMDELISTPSVEDKSEEPPGTELQRDTEA SHLAPNHTTFPKIHSRRFRDYCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKR RLVLGLREVLKHLKLKKLKCVIISPNCEKIQSKGGLDDTLHTIIDYACEQNIPFVFALNR KALGRSLNKAVPVSVVGIFSYDGAQDQFHKMVELTVAARQAYKTMLENVQQELVGEPRPQ APPSLPTQGPSCPAEDGPPALKEKEEPHYIEIWKKHLEAYSGCTLELEESLEASTSQMMN LNL >gi568815589f:89211293_89416422|GENSCAN_predicted_CDS_2|3612_bp atgaggatgagacctactgagctgcattcccaggaggttaaggtgttcttagtcgcaaga gcgtggaacgctcataagaaggcaaatgaggctggcatgtccaggaagctacgactggga gtgtgggccatgactggtgttcttacaggacatgagcatcaatcaactcaatacccagcc ctaatcatgccttttgctaaaaagccttatcctaccaccaagccaacattgtcgcttcta gccacagtctcccctgacactccacacacatatctctttgtgcccatcacactggtctgt gatagtctgctccctggttgcatggacttgcacaaaggtgagcccaccacagggcagcca cagcgacagagagggaccatgcgggcaaagccagctgcattccaagttgtacccttccat caaggtcatcagactgaaagacttaaaacttcctacgcgctcagaatctctcttacaatc tgtactaacacattttctgtcccggataatctaaaggaagaaccaaccatgcactcctat cctatgctcttgtctggctttggtaaccaggataatgctggcttcataaaaggggtttgg aatgtttcctcctcttcaatttttttggctccggacctgcagcccgccatgcctgagcct cccaccccctccatgggctcctgtggggcccgagtctccccgataagtgccgccccctgc tccacggcacccagtcccatcgaccacccaagggctgaggagagcgggcgcagggcgcag gactggcaggcagcttcacctgcagccccagtgcgcgatccagtgggtgaagccagctgg gttgagtctggtggggacgtggcgaaccttttgtctagctcagggattgtaaatacacca atcggcactctgtacctagctcaaggaaaggggtccagtccagatcccaagggagaattc ttggatctcgtgcaagaaagaattcagggtgagtccactgtgcaaaatgaaaaacctttc tgccttttactacttttcaaagtggacctaggctcctggctcgagcctgaggggatacaa gggatcacgaggagcgccatcaccgtgcaaggtcgacagcttccgctcaaagccctgtgg cccacgcttccggcctcgaccgactacgtcatcaccaaaccccgtgatacctcactccac cttcagcgcgccagcaggatggcccacaagcagatctactactcggacaagtacttcgac gaacactacgagtaccggcatgttatgttacccagagaactttccaaacaagtacctaaa actcatctgatgtctgaagaggagtggaggagacttggtgtccaacagagtctaggctgg gttcattacatgattcatgagccaggtccagtgccgccgccgcgtacttccggccggaag aaagggcgggctctgtcgctttgctgtgacgcacttcctggcgtcgcctgcgggggcgga aacgctttgtctgtccggcaagccgacggcccgctgctggcctccgtgacgcggcctcct ccgcgcctcgcggcatggcgtcggaggggccgcgggagcccgaaagcgagttgttaccga ggttttcaaacagtgaagcatcgaaatgagaacacatgccctctcccacaagaaatgaaa gctctgtttaagaagaaaacctatgatgagaaaaaaacgtatgatcagcaaaagtttgac agtgaaagggctgatggaactatatcatctgagataaaatcagctagaggttcacatcat ttgtccatttacgctgagaatagtttgaaatcagatggttaccataagcgaacagacagg aaatccagaatcattgcaaaaaatgtatctacctccaaacctgagtttgaatttaccaca ctggactttcctgaactgcaaggtgcagagaacaatatgtcagagatacagaagcaaccc aagtggggacctgtccactctgtctctaccgacatttctcttctaagagaagtagtaaaa ccagctgcagtgttatcaaagggtgaaatagtggtgaaaaataacccaaatgaatctgta actgctaatgccgctaccaattctccttcatgtacaagagagttatcttggacaccaatg ggttatgttgttcgacagacattatctacagaactgtcagcagcccctaaaaatgttact tctatgataaacttaaagaccattgcttcatcagcagatcctaaaaatgttagtatacca tcttctgaagctttatcttcggatccttcctacaacaaagaaaaacacattattcatcct acccaaaaggataattttaaaaataatgtaaagaagagccagcttccagtgcagttggac ttggggggcatgctgacagccctggagaagaagcagcactctcagcatgcaaagcagtcc tccaaaccagtggtagtctcagttggagcagtgccagtcctttccaaagaatgtgcatca ggggagagaggccgccgcatgagtcaaatgaagaccccgcacaatcccttggactccagc gccccactgatgaagaaagggaagcagagggagatccccaaggccaagaagccaacctca ctgaagaagattattttgaaagaacggcaagagagaaagcagcgtctccaagaaaatgct gtgagtccagcttttaccagtgatgacacacaagatggagagagtggtggtgatgaccag tttcccgagcaggcagagctgtcagggccagaggggatggacgaactgatctccactcct tcggttgaggacaagtctgaagagccaccaggcacagagctccagagggacacagaggcc tcccaccttgctcccaatcacaccaccttccctaagatccacagccgcagattcagggat tactgcagccagatgcttagtaaagaagtggatgcttgtgttaccgacctactcaaagaa ctggtccgtttccaagaccgtatgtaccagaaagatccagtcaaggccaagactaaacgt cgacttgtgttggggttgagggaggttctcaaacacctgaagctcaaaaaactgaaatgt gtcattatttctcccaactgtgagaagatacagtcaaaaggtgggctggatgacactttg cacacaattattgattatgcctgtgagcagaacattccctttgtgtttgctctcaaccgc aaagctctggggcgcagtttgaataaggcagttcctgtcagtgtggtggggatcttcagc tatgatggggcccaggatcagttccacaagatggttgagctgacagtggcggcccgacag gcgtacaagaccatgctggagaatgtgcagcaggagctggtgggagagcccaggcctcag gcacctcccagcctacccacacagggccccagctgccctgcagaagatggccccccagcc ctgaaagaaaaagaagagccacactacattgaaatctggaaaaaacatctggaagcatac agtggatgtaccctggagctagaagaatccttggaggcttcaacctctcaaatgatgaat ttgaatttatga >gi568815589f:89211293_89416422|GENSCAN_predicted_peptide_3|1186_aa MCTPIRGLLMALAVMFGTAMAFAPIPRITWEHREVHLVQFHEPDIYNYSALLLSEDKDTL YIGAREAVFAVNALNISEKQHETECLNYIRVLQPLSATSLYVCGTNAFQPACDHLNLTSF KFLGKNEDGKGRCPFDPAHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHSPLRTEYAI PWLNGKEEQCTLVGEPSFVFADVIRKSPDSPDGEDDRVYFFFTEVSVEYEFVFRVLIPRI ARVCKGDQGGLRTLQKKWTSFLKARLICSRPDSGLVFNVLRDVFVLRSPGLKVPVFYALF TPQLNNVGLSAVCAYNLSTAEEVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACIDS EARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIVVDRTQALDG TVYDVMFVSTDRGALHKAISLEHAVHIIEETQLFQDFEPVQTLLLSSKKEPPFVGTWLSC LDKRKYILPAVGSCAFSLGLSRLQVPPAGPNRTSCPLEVTQLGFAARECFRHLHKPRDAR TVIWDVILNPSHKRGSHMASSAQVSAAAAAGNRFVYAGSNSGVVQAPLAFCGKHGTCEDC VLARDPYCAWSPPTATCVALHQTESPSRGLIQEMSGDASVCPDKSKGSYRQHFFKHGGTA ELKCSQKSNLARVFWKFQNGVLKAESPKYGLMGRKNLLIFNLSEGDSGVYQCLSEERVKN KTVFQVVAKHVLEVKVVPKPVVAPTLSVVQTEGSRIATKVLVASTQGSSPPTPAVQATSS GAITLPPKPAPTGTSCEPKIVINTVPQLHSEKTMYLKSSDNRLLMSLFLFFFVLFLCLFF YNCYKGYLPRQCLKFRSALLIGKKKPKSDFCDREQSLKETLVEPGSFSQQNGEHPKPALD TGYETEQDTITSKVPTDREDSQRIDDLSARDKPFDVKSSGAGPDSSSRVSLLPPFLSDQA QHVHALGNFYLFCQATVLASRKWNYQWKMIRKDPLMALAQASLVPGAGWDGPEKALGHPR GVIVGTMPLQCPGPADIRFVWEKNGRALETCVPVQTHALPDGRAHALSWLQDAIRESAEY RCSVLSSAGNKTSKVQVAVMRPEVTHQERWTRELSAWRAVAGEHDRMMQSWRKAWNTGLY FIKRCSLCLCPVNAQLLQQDTKLDAESQIGQLCKSPGAILNFLTME >gi568815589f:89211293_89416422|GENSCAN_predicted_CDS_3|3561_bp atgtgcacccccattagggggctgctcatggcccttgcagtgatgtttgggacagcgatg gcatttgcacccataccccggatcacctgggagcacagagaggtgcacctggtgcagttt catgagccagacatctacaactactcagccttgctgctgagcgaggacaaggacaccttg tacataggtgcccgggaggcggtcttcgctgtgaacgcactcaacatctccgagaagcag catgagacagagtgcctcaactacatccgggtgctgcagccactcagcgccacttccctt tacgtgtgtgggaccaacgcattccagccggcctgtgaccacctgaacttaacatccttt aagtttctggggaaaaatgaagatggcaaaggaagatgtccctttgacccagcacacagc tacacatccgtcatggttgatggagaactttattcggggacgtcgtataattttttggga agtgaacccatcatctcccgaaattcttcccacagtcctctgaggacagaatatgcaatc ccttggctgaacggtaaggaagagcagtgcaccttagtgggagagcctagtttcgtgttt gctgacgtgatccgaaaaagcccagacagccccgacggcgaggatgacagggtctacttc ttcttcacggaggtgtctgtggagtatgagtttgtgttcagggtgctgatcccacggata gcaagagtgtgcaagggggaccagggcggcctgaggaccttgcagaagaaatggacctcc ttcctgaaagcccgactcatctgctcccggccagacagcggcttggtcttcaatgtgctg cgggatgtcttcgtgctcaggtccccgggcctgaaggtgcctgtgttctatgcactcttc accccacagctgaacaacgtggggctgtcggcagtgtgcgcctacaacctgtccacagcc gaggaggtcttctcccacgggaagtacatgcagagcaccacagtggagcagtcccacacc aagtgggtgcgctataatggcccggtacccaagccgcggcctggagcgtgcatcgacagc gaggcacgggccgccaactacaccagctccttgaatttgccagacaagacgctgcagttc gttaaagaccaccctttgatggatgactcggtaaccccaatagacaacaggcccaggtta atcaagaaagatgtgaactacacccagatcgtggtggaccggacccaggccctggatggg actgtctatgatgtcatgtttgtcagcacagaccggggagctctgcacaaagccatcagc ctcgagcacgctgttcacatcatcgaggagacccagctcttccaggactttgagccagtc cagaccctgctgctgtcttcaaagaaggagcctccattcgtggggacgtggcttagttgc ctggacaagcggaaatacatcctgcctgccgtggggtcctgtgccttctccttgggcctc agtaggctccaggtgccccccgcgggacccaacaggaccagctgtcccttggaagttacc cagcttggctttgctgccagagaatgtttccgccacctccacaagccgcgcgatgcccga actgtgatctgggatgtcatcctcaatccttcccacaaacgtggctcacacatggcctca tctgcccaggtctctgctgctgctgctgctggcaacaggtttgtctatgctggctctaac tcgggcgtggtccaggccccgctggccttctgtgggaagcacggcacctgcgaggactgt gtgctggcgcgggacccctactgcgcctggagcccgcccacagcgacctgcgtggctctg caccagaccgagagccccagcaggggtttgattcaggagatgagcggcgatgcttctgtg tgcccggataaaagtaaaggaagttaccggcagcattttttcaagcacggtggcacagcg gaactgaaatgctcccaaaaatccaacctggcccgggtcttttggaagttccagaatggc gtgttgaaggccgagagccccaagtacggtcttatgggcagaaaaaacttgctcatcttc aacttgtcagaaggagacagtggggtgtaccagtgcctgtcagaggagagggttaagaac aaaacggtcttccaagtggtcgccaagcacgtcctggaagtgaaggtggttccaaagccc gtagtggcccccaccttgtcagttgttcagacagaaggtagtaggattgccaccaaagtg ttggtggcatccacccaagggtcttctcccccaaccccagccgtgcaggccacctcctcc ggggccatcacccttcctcccaagcctgcgcccaccggcacatcctgcgaaccaaagatc gtcatcaacacggtcccccagctccactcggagaaaaccatgtatcttaagtccagcgac aaccgcctcctcatgtccctcttcctcttcttctttgttctcttcctctgcctctttttc tacaactgctataagggatacctgcccagacagtgcttgaaattccgctcggccctacta attgggaagaagaagcccaagtcagatttctgtgaccgtgagcagagcctgaaggagacg ttagtagagccagggagcttctcccagcagaatggggagcaccccaagccagccctggac accggctatgagaccgagcaagacaccatcaccagcaaagtccccacggatagggaggac tcacagaggatcgacgacctttctgccagggacaagccctttgacgtcaagtcctcgggt gcggggcccgacagcagctcgagggtctccttgctgccgcccttcctgagtgaccaggca cagcacgtgcacgccctggggaacttctacctcttctgccaggccacagttctcgcctct aggaagtggaattatcagtggaagatgatacggaaagatccactgatggccctggcacag gccagccttgtccctggggcaggatgggacggtcctgagaaggccctgggacaccctagg ggtgtcatagtgggaactatgcctttgcagtgcccaggtcctgcagacattcgctttgtc tgggagaagaatgggcgagctctggagacctgtgtccctgtgcagacccatgcactgccc gatggcagggcccatgcactcagctggctgcaggacgccatcagggaaagcgctgagtat cgctgctctgtcctctcctcagcagggaacaagacttcgaaggtgcaggttgctgtgatg agacctgaagtgacccaccaggagaggtggaccagagagctctctgcctggagggctgtg gctggggagcacgaccggatgatgcagagctggaggaaggcgtggaatactgggctctat tttatcaagcgctgcagtttatgcctctgtcccgtcaatgctcagcttctgcaacaggac accaaacttgatgcagaaagccaaataggtcaattatgcaaatctcctggtgccatatta aatttcttgacgatggaatga