GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:33:38 Sequence gi568815596r:199833070_200055470 : 222401 bp : 39.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 505 544 40 -3.65 1.01 Init + 5456 5700 245 1 2 47 83 211 0.622 13.75 1.02 Term + 6843 6981 139 0 1 77 55 122 0.994 4.25 1.03 PlyA + 7668 7673 6 1.05 2.07 PlyA - 9391 9386 6 1.05 2.06 Term - 11463 11330 134 1 2 69 36 141 0.932 4.27 2.05 Intr - 13101 13006 96 1 0 101 86 63 0.944 6.46 2.04 Intr - 15900 15779 122 2 2 66 58 57 0.140 -0.28 2.03 Intr - 20504 20437 68 2 2 62 76 71 0.230 0.08 2.02 Intr - 27896 27783 114 2 0 110 25 47 0.011 0.32 2.01 Init - 45388 45272 117 1 0 75 87 116 0.310 10.45 2.00 Prom - 46200 46161 40 -5.55 3.00 Prom + 53232 53271 40 -7.25 3.01 Init + 55230 55417 188 2 2 60 40 148 0.368 5.78 3.02 Term + 55888 56602 715 1 1 3 42 263 0.550 5.17 3.03 PlyA + 56652 56657 6 1.05 4.00 Prom + 58319 58358 40 -5.95 4.01 Init + 58611 58932 322 2 1 67 1 170 0.050 3.74 4.02 Intr + 63463 63512 50 2 2 109 81 9 0.052 -0.22 4.03 Intr + 69186 69306 121 2 1 49 73 162 0.211 9.95 4.04 Intr + 78164 79044 881 0 2 28 3 564 0.001 32.01 4.05 Term + 91993 92817 825 0 0 102 50 406 0.156 30.27 4.06 PlyA + 93812 93817 6 1.05 5.05 PlyA - 94181 94176 6 1.05 5.04 Term - 100254 99998 257 1 2 71 36 250 0.945 12.86 5.03 Intr - 102978 102862 117 1 0 72 99 48 0.928 3.82 5.02 Intr - 103423 103336 88 1 1 93 86 43 0.917 3.22 5.01 Init - 111818 111669 150 2 0 62 49 72 0.173 0.79 5.00 Prom - 117071 117032 40 -7.55 6.00 Prom + 119236 119275 40 -7.15 6.01 Init + 122730 123179 450 2 0 80 72 491 0.999 40.66 6.02 Intr + 126199 126270 72 0 0 70 88 95 0.977 6.38 6.03 Intr + 126685 126811 127 0 1 76 85 121 0.985 9.93 6.04 Intr + 128712 128859 148 1 1 66 80 66 0.719 1.97 6.05 Intr + 149470 149523 54 1 0 94 72 46 0.385 0.78 6.06 Intr + 151159 151249 91 1 1 118 74 64 0.893 7.08 6.07 Intr + 153203 153217 15 1 0 109 103 24 0.583 0.82 6.08 Term + 161509 161664 156 0 0 43 39 132 0.071 0.75 6.09 PlyA + 161971 161976 6 1.05 7.07 PlyA - 163560 163555 6 1.05 7.06 Term - 166838 166730 109 1 1 43 50 137 0.124 2.40 7.05 Intr - 176983 176875 109 2 1 63 30 94 0.036 -0.48 7.04 Intr - 185317 185092 226 1 1 -4 57 251 0.517 9.44 7.03 Intr - 190761 190548 214 2 1 42 23 161 0.008 2.80 7.02 Intr - 207493 207442 52 2 1 79 92 43 0.617 0.85 7.01 Init - 213090 212913 178 0 1 71 87 73 0.139 4.97 7.00 Prom - 216364 216325 40 -4.05 8.02 PlyA - 217978 217973 6 1.05 8.01 Term - 222146 221857 290 0 2 87 55 218 0.542 12.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_1|127_aa MCEGDVIRQTQGSKWSSREAPDASHRRLIEGRTSSPTLGGKAKGKLKTRRWVGSRLQQRS EPIPDPHRQAPALPHPGRICKRKLLKDVFHQNDTINQESQKRKREDMGFRLRTRASNMEK DESHDDG >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_1|384_bp atgtgtgagggggatgtgattcggcagactcaaggcagcaagtggagcagcagagaagca cctgatgcatcccacagacgcctgatagagggcagaaccagcagccccactttgggaggg aaggcaaaggggaagctcaaaaccagaagatgggtgggaagtcggcttcagcagcggtca gagcccatccctgatccacacagacaggcacctgccctgccacaccctggtagaatatgt aagagaaagctactaaaggatgtgttccaccaaaacgacaccataaaccaagaaagtcaa aaaagaaaaagagaagacatgggattcaggttgaggaccagggcatccaacatggagaaa gatgaaagtcatgatgatggctga >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_2|216_aa MSSGTMGADAKGCHTDPVPLLAESSCLMQKGRGPLSCQHWFSTKGNFALGRSLAMSGDIF GFSSCMGGGGGNCCWHLGERLTDLTFILALLNQHLGCGLRKIMSSSRVGLRLAACLLNVS EAGRKYIVENIAKAALLDKNGKKHPQVSVLNIFSDQDYKRSVITIATSVDKLGSSVLAAC LEAFQAIDMEVQEGIHPCLGAVDLIPIYPLSGVTVE >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_2|651_bp atgagctcaggtaccatgggtgcagatgcaaaaggctgtcacactgaccctgtgcccttg ctggcagaaagcagctgcctcatgcaaaaaggcagagggccactgagctgtcaacactgg ttctcaaccaagggcaattttgcccttggcaggtctttggcaatgtctggagacattttt ggtttttccagctgcatgggtgggggtggagggaactgctgctggcatctaggtgaacgc ctcactgaccttaccttcatcctggccctgctcaaccaacacctgggctgtgggctcaga aaaatcatgtcttcttccagagtggggctccgtttggctgcctgtttactaaacgtttca gaagccggaagaaaatacattgttgagaacatagcaaaagcagctcttcttgacaaaaat ggaaagaaacatcctcaagtttcagtgctcaatatattttccgatcaagactacaagaga tcagtcattacaatagcaacttctgttgataagttgggcagttctgttctggctgcctgc ctagaggccttccaggctatcgatatggaagttcaagagggaatccacccttgcctggga gcagtggacttgattccgatttaccctctctctggtgtcacagtggaatag >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_3|300_aa MKEKMLSTAREKGQVTHKGLKPIIQQQTSLQKPYKPEESGGQYSTFLEFSTQNFISSQTK LHRLKPTNIKKDKEGHYIMVKGSMQQEELTILNIYALNTRAPRFIKQVLGDLQKDLDSHA IIMGDFNTPLSILDRSMRQKINKDIQDLNSALHQADLIDIYRTLHSKTTEYTFYSAPHCT YSKIDHIIGSKTLLSKCKRTEIITNSLSDHSAITLELRIKKFTQNHTTTWKLKNLLLNDY WVNKKIKAEINKFFETSENKGTAYQNLWDTAKAVFRGKFIGLNAHRRKQERSKIDTLTSQ >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_3|903_bp atgaaggaaaaaatgttaagcacagccagagagaaaggtcaggttacccacaaagggctg aagcccatcatccaacagcagacctctctgcagaaaccctacaagccagaagagagtggg ggccaatattcaacattcttagaattttcaacccagaatttcatatccagtcaaactaag cttcatagacttaaaccaacaaacatcaaaaaagacaaagaaggccattacataatggta aagggatcaatgcaacaagaagagctaactatcctgaatatatatgcactcaatacaaga gcacccagattcataaagcaagttcttggagacctacaaaaagacttagactcccatgca ataatcatgggagactttaacaccccactgtcaatattagacagatcaatgagacagaaa attaacaaggatattcaggacttgaactcagctctgcaccaagcagacctaatagacatc tacagaactctccactccaaaacaacagaatatacattctactcagcaccacattgcact tattctaaaattgaccacataattggaagtaaaacactcctcagcaaatgcaaaagaaca gaaatcataacaaatagtctctcagaccacagtgcaatcacattagaactcaggattaag aaattcactcaaaaccacacaactacatggaaactgaagaacctgctcctgaatgactac tgggtaaataagaaaattaaagcagaaataaataagttctttgaaaccagtgagaacaaa ggcacagcataccagaatctctgggacacagctaaagcagtatttagagggaaatttata ggactaaatgcccacaggagaaagcaggaaagatccaaaatcgacaccctaacttcacaa taa >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_4|732_aa MGKDFMAKTPKAMATKAKTDKWDLIKLKSFCTAKETIIRVNRQPTEWKKIFAMYLPDKGL ISRIYKELKQIYKKKPHQKVGKGYEQTVVKRKYLCGQKTYERKLIITGNSSSKEEICFQS GLTVICISGRAATLVPDTEHEILKYETMTGRELGCLAVEVNVFVYRFRPHPSLPFLTHAA RGRRPLTSSPQRRLPAGPRPPTVEPPAEPPAEVPPSGTPPPPSTSEPLSRRRPMWGFRLL RSPPLLLLLPQLGIGNASSCSQARTMNPGGSGGARCSLSAEVRRRQCLQLSTVPGADPQR SNELLLLAAAGEGLERQDLPGDPAKEEPQPPPQHHVLYFPGDVQVTRGLPGYLFLPLVYT YAVTDSSLVAAAWFLSLPEHTHTFTEGGSSSTPRNPKPCLPRPWRPLSIEPHGEGVRRGL RSRKSQLSVQVSKRRNLRQGVTRILYIYSIIWFLRGPCNYHEIMTRHPENYQWENWSLEN VATILAHRFPNSYIWVIKCSRMHLHKFSCYDNFVKSNMFGAPEHNTDFGAFKHLYMLLVN AFNLSQNSLSKKSLNVWNKDSIASNCRSSPSHTTNGCQGEKVRTCEKSDESAMSFYPPSL NDASFTLIGFSKGCVVLNQLLFELKEAKKDKNIDAFIKSIRTMYWLDGGHSGGSNTWVTY PEVLKEFAQTGIIVHTHVTPYQVRDPMRSWIGKEHKKFVQILGDLGMQVTSQIHFTKEAP SIENHFRVHEVF >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_4|2199_bp atgggcaaagacttcatggctaaaacaccaaaagcaatggcaacaaaagccaaaactgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatggaagaaaatttttgcaatgtatctacctgacaaagggcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaccccatcaaaaagtg ggcaaaggatatgaacagacagttgtcaaaagaaaatatttatgtggccaaaaaacttat gaaagaaagctcatcatcactggtaacagttcttcaaaggaagaaatatgttttcagtca ggactcacagttatttgcatatcaggacgtgctgccacattggtacctgatacagaacat gaaattctgaagtatgaaaccatgactggacgagaattagggtgtcttgcggtagaagtg aatgtatttgtataccgatttcgcccacacccctcccttcctttccttacgcacgctgcg cgcggacgtcggcctctgacgtcgtcgcctcagcgccggctcccggccgggccgcggccg ccgaccgttgagccgccggctgagccgcctgctgaagtccctccctcaggaacccctccg ccaccctccacctccgaaccgctctcgcggcggcgacccatgtgggggttcaggctcctg cggtcgccgccgttgctgctcctgctgccgcagctcggaatcggaaacgcctcgtcctgc tctcaggccagaaccatgaacccgggcggcagcggcggcgcgcgatgctccctctcggcc gaggtgcgccgccgtcagtgcctgcagctttccaccgtgcctggagccgatccgcagcgc agcaacgaattgctcctgttggcggcggccggggagggactggagcggcaggacctcccc ggggacccagcgaaggaggagccgcagccgccgccccagcatcacgtcctctatttccct ggggatgtgcaggtaactcggggcctgcctgggtatctgttccttcctctcgtgtatacg tacgcggtcactgattcttccctcgtggctgctgcctggttcctttccttgcctgaacac acccacacattcactgagggtggctcttcctctactccacgaaatccaaaaccgtgcctt ccacgcccttggcgccctttgtccatagaaccccatggggaaggtgtccgacgagggcta cgatctcgaaagagccagctgtcggtccaggtttccaagaggcggaatttgagacaggga gtgacacgcatcttgtatatttattctataatttggtttctcagaggaccgtgtaattac catgaaattatgactcgtcatcctgagaattatcaatgggaaaactggagtctagaaaat gttgctaccattttagcccaccggttccccaatagttatatttgggtgataaaatgttcc cgaatgcatttgcacaaattcagctgctatgacaattttgtgaaaagtaacatgtttggt gccccagaacacaatactgactttggagcttttaagcacctttatatgttattagttaat gcttttaatttaagtcagaatagtttatcaaagaaaagtttgaatgtttggaataaggac tccatagcatctaactgtagatccagtccttctcatactacgaatggttgccagggagaa aaagtgaggacctgtgaaaaatctgatgagtctgccatgagtttttatccaccatcacta aatgacgcatcttttactttgattggattcagtaaaggttgtgttgttttgaatcagttg ctttttgaattgaaagaagccaagaaagacaagaacatagatgcttttatcaaaagcata agaacaatgtattggctggatggtggtcattctggaggaagcaatacttgggttacttat ccagaagtcttgaaagaatttgcacaaacaggaattatcgttcacactcatgtaacacct taccaagtacgtgatccaatgagatcttggattggaaaggagcacaagaaatttgttcag atacttggggatcttggtatgcaggtgactagccaaattcattttacaaaggaagctcct tccatagagaatcacttcagggttcatgaagtattttga >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_5|203_aa MFVFTQYISGKTVQGESKLAQGHLLIEKEKVFNLFFRLSQYRVQVGRYLVVMDNLLIQVT GKKRVVLFSPRDAQYLYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPAL WFHNVISEEFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRAAQILDRALKTLAELPEE YRDFYARRMVLHIQDKAYSKNSE >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_5|612_bp atgtttgtgtttacccagtatatttctgggaaaacagtgcaaggagaaagtaagctggcc caaggtcatcttctcatagagaaggagaaagtatttaaccttttctttcgactttctcag tacagagttcaagtgggaagatatttagtagtaatggataatttgttaatacaagtgaca ggaaaaaagcgtgttgtactcttcagtcctcgagatgcccagtatttatatttaaaaggt actaaatcagaagtactgaatatagataacccagacttggctaaatatccacttttttcc aaggctagaagatatgaatgttcccttgaagctggtgatgtattattcattcctgcttta tggttccataatgtaatttctgaagagtttggagtgggagtgaatatcttttggaagcac cttccatctgaatgctatgataaaacagatacctatggaaacaaagatcctacagcagca tcaagagctgcacaaattctggacagagccttgaaaacactggccgagttaccagaggaa tatagggacttctatgcacgacgaatggtcctacacattcaagacaaagcctacagcaag aactctgagtaa >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_6|370_aa MALAARLLPQFLHSRSLPCGAVRLRTPAVAEVRLPSATLCYFCRCRLGLGAALFPRSARA LAASALPAQGSRWPVLSSPGLPAAFASFPACPQRSYSTEEKPQQHQKTKMIVLGFSNPIN WVRTRIKAFLIWAYFDKEFSITEFSEGAKQAFAHVSKLLSQCKFDLLEELVAKEVLHALK EKVTSLPDNHKNALAANIDEIVFTSTGDISIYYDEKGRKFVNILMCFWYLTSANIPSETL RGASVFQVKLGNQNVETKQLLSASYEEVGPPYGDRTRLCQTAPGYCHIPHARSLTWCVGP GDVKHLWTSNGISMTERDHAESTTKTADPWNFTAFPGIITATATAYSKREIAFDELCKDK LRRDCHPASR >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_6|1113_bp atggcgctggccgctcgtttgctaccccagttcctgcactctcggtcgctgccctgcggg gccgtccgactccggactcctgctgtggccgaggtgaggctgccgtcggccacactttgc tacttctgccgctgtcgcctcggcttgggagcggcgttatttccacgaagcgctagggcc ttggcagcctcggcgctacctgcccagggctcccggtggccagtgctcagcagcccggga ctccccgcagccttcgcttctttccctgcctgccctcagcgcagctacagcacggaggag aagccccagcagcaccagaaaaccaagatgatcgtcctgggattctccaaccccatcaac tgggttaggactcgaattaaggccttccttatctgggcctatttcgacaaagagttcagc atcacagagttctccgagggagcgaagcaggcttttgctcatgtatccaagttgctgtca cagtgtaaatttgatctgttggaagaacttgtggccaaagaggtgctacatgcattgaaa gaaaaggttacttcactacctgacaaccataaaaatgcccttgctgctaacatagatgaa attgtatttacatcaacaggagacatctccatttactatgatgagaaaggaaggaagttt gttaacatcctgatgtgcttttggtatctaaccagtgccaacatccccagtgaaacttta agaggagccagtgtattccaggttaagttggggaatcagaatgtggaaactaaacaactt cttagtgcaagctatgaggaagtgggaccaccctatggggacaggaccagactgtgccag actgcgccagggtactgccatattccacatgcccgttccctcacgtggtgtgtaggtcct ggggatgttaaacacttatggacctcaaatggcatcagcatgactgaacgtgatcatgca gagtcaactaccaaaactgctgatccatggaacttcacagcctttccgggcataataacg gctacagctacagcttattcaaagagagaaatagcctttgatgagctgtgcaaggacaaa ttaagacgtgactgtcacccagcatctagatag >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_7|295_aa MGKMSPGHFRDLQGNPSHHRPEGLEGKNGGPGPYCFVQIWDMAPCIPATPAPATAKRGQA SVFTKEKKYLLHSVIVRCFDHVLFTALVKEVFQDTGHHTDKIDSSLFITDTDSLGEDIAQ HTGPHRDCTWDQGEQPGAMGGKLGSIKRSGVPENVEATSELGNKQRLEQFGRLRRRQENV GKFETPRDVLNGFDQNADNDMDNEKQADVVSDGDEELVGNWSKGCHKAAIKVLAMAAVSA KDLTGEEFTSKLMQFLAGFRFPDQEELHLKSETSSAVGPDLENEIMAIKSAAIIR >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_7|888_bp atggggaaaatgtctccagggcatttcagagatcttcaaggcaacccctctcatcacagg cctgaaggcctagaagggaaaaatggtggcccagggccttactgctttgtgcagatttgg gatatggcaccctgcataccagccactccagctccagccacagctaaaaggggccaagct tcggtcttcaccaaagaaaagaaatacctacttcatagtgtcattgtgaggtgctttgat catgtcctattcaccgcattggtgaaggaggtatttcaggacactggacaccacacagat aagattgacagcagtttattcatcactgatactgatagtctgggagaggacattgctcaa cacacagggccacacagggactgcacttgggaccaaggtgaacaaccaggggccatggga ggcaagcttggcagtatcaagaggagtggggtacctgaaaatgtggaagcaacttcggaa ctgggtaacaaacagaggttggaacagtttggaaggctcagaagaagacaggaaaatgtg ggaaagtttgaaactcccagagatgtgttgaatggctttgaccaaaatgctgataatgat atggacaatgaaaagcaggctgacgtggtctcagatggagatgaggaacttgttgggaac tggagcaaagggtgtcacaaagcagcaattaaggtgttagccatggctgctgtctcagct aaagacttgactggagaagaatttacttccaaactcatgcagtttttagcaggattcagg tttccagatcaagaggagttgcacttgaagagtgaaacctcatctgcagttggacctgat ttggagaatgaaatcatggccatcaagtctgctgccataataagatga >gi568815596r:199833070_200055470|GENSCAN_predicted_peptide_8|96_aa XRRQQERGMRDAKQQKCWPICSLRELRPREVATTREPRLRVPGVSGLEALLSEEYRGVGT HLENSLAAFLQVCCPLLGVHVVPNHCAPSLAQGQQE >gi568815596r:199833070_200055470|GENSCAN_predicted_CDS_8|291_bp naccgaaggcaacaggagcgagggatgagggatgccaaacaacaaaaatgttggcctata tgctccctccgggagctccgtcccagggaagtagccactacccgagagcccaggctgcga gtgcctggagtctcaggtctggaggccctgctcagtgaggagtatcgaggggtgggaacc cacctggaaaatagtctggccgctttcctgcaagtctgctgccctcttctgggggtccac gttgtccctaatcactgtgctccctccctggcccaagggcaacaggaatga