GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:26:50 Sequence gi568815596f:68874636_69080820 : 206185 bp : 39.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7875 7966 92 1 2 114 88 39 0.054 5.29 1.02 Term + 14917 15120 204 0 0 89 46 96 0.072 1.89 1.03 PlyA + 16654 16659 6 -1.75 2.03 PlyA - 16707 16702 6 1.05 2.02 Term - 17690 17530 161 0 2 64 31 131 0.743 2.12 2.01 Init - 21329 20933 397 2 1 78 92 207 0.491 17.01 2.00 Prom - 46264 46225 40 -4.65 3.00 Prom + 47660 47699 40 -4.65 3.01 Init + 49973 50029 57 1 0 95 111 75 0.450 11.86 3.02 Intr + 51050 51191 142 1 1 88 93 98 0.989 9.31 3.03 Intr + 52105 52179 75 2 0 93 81 32 0.729 1.57 3.04 Intr + 54054 54220 167 1 2 97 94 40 0.581 4.26 3.05 Term + 57637 57867 231 0 0 103 40 129 0.413 5.09 3.06 PlyA + 59239 59244 6 -0.45 4.05 PlyA - 59631 59626 6 1.05 4.04 Term - 61383 61198 186 0 0 83 50 111 0.904 3.31 4.03 Intr - 62415 62394 22 2 1 105 98 37 0.894 3.03 4.02 Intr - 65642 65471 172 0 1 78 46 47 0.015 -2.42 4.01 Init - 69572 69161 412 2 1 79 81 333 0.910 28.42 4.00 Prom - 70382 70343 40 -8.65 5.06 PlyA - 70523 70518 6 -0.45 5.05 Term - 70815 70733 83 1 2 99 45 47 0.771 -1.62 5.04 Intr - 71825 71669 157 2 1 44 94 100 0.817 4.86 5.03 Intr - 72622 72512 111 1 0 62 91 108 0.976 8.16 5.02 Intr - 75628 75491 138 1 0 85 77 105 0.980 8.84 5.01 Init - 84466 84440 27 1 0 98 73 19 0.254 1.13 5.00 Prom - 84630 84591 40 -3.85 6.00 Prom + 88937 88976 40 -6.75 6.01 Init + 89142 89204 63 2 0 56 75 72 0.180 3.90 6.02 Intr + 99221 99388 168 1 0 53 77 165 0.515 11.12 6.03 Intr + 103002 103139 138 2 0 60 51 208 0.823 14.04 6.04 Intr + 104236 104346 111 0 0 87 93 42 0.896 4.26 6.05 Intr + 105278 105425 148 1 1 76 113 129 0.895 13.09 6.06 Term + 107108 107115 8 0 2 98 50 0 0.210 -5.75 6.07 PlyA + 107174 107179 6 1.05 7.06 PlyA - 107236 107231 6 1.05 7.05 Term - 110744 110585 160 1 1 21 44 170 0.445 2.33 7.04 Intr - 114416 114265 152 2 2 9 62 158 0.235 3.34 7.03 Intr - 120905 120782 124 1 1 67 105 77 0.609 6.97 7.02 Intr - 136657 136482 176 1 2 45 64 116 0.102 2.72 7.01 Init - 139119 138352 768 0 0 60 26 293 0.137 13.04 7.00 Prom - 140940 140901 40 -6.95 8.00 Prom + 145305 145344 40 -4.55 8.01 Sngl + 148604 148960 357 1 0 41 42 585 0.997 45.01 8.02 PlyA + 149452 149457 6 1.05 9.03 PlyA - 155374 155369 6 1.05 9.02 Term - 158056 157850 207 1 0 45 41 198 0.651 7.16 9.01 Init - 162633 162493 141 0 0 66 63 55 0.295 0.98 9.00 Prom - 163968 163929 40 -5.55 10.00 Prom + 180641 180680 40 -5.85 10.01 Init + 184446 184580 135 2 0 50 64 137 0.065 7.69 10.02 Intr + 197119 197152 34 0 1 83 116 7 0.523 -0.12 10.03 Intr + 198387 198466 80 1 2 68 99 76 0.872 5.05 10.04 Intr + 200955 201023 69 2 0 121 80 88 0.664 9.76 10.05 Intr + 202773 202853 81 2 0 70 89 156 0.997 12.82 10.06 Term + 203895 204083 189 2 0 110 48 87 0.879 3.37 10.07 PlyA + 205761 205766 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 62704 62610 95 1 2 75 14 108 0.872 2.00 S.002 Init + 184446 184584 139 2 1 50 33 148 0.808 5.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_1|98_aa XPSVNPVPSSTKPVNSLTVHTQSTCFHIPRKPFSTQYWTHIPLEPVAAAASCDKWPRIAD AHHSPPSVVERVISDQIPAAATSSPYLGIGDFYEGGWM >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_1|297_bp ngaccctccgtcaaccctgtcccatcatcgaccaagcccgtaaactctcttactgtgcac acacagtccacttgtttccatattccaaggaagcctttcagcacccaatattggacacac attcctttagagcctgtggcggccgctgcctcgtgcgacaagtggccgcgcattgcagat gcacaccacagtccaccaagcgtggtggaacgggttatctcagatcaaattcctgcggca gccacttcctctccttatctgggaatcggggatttttatgagggggggtggatgtaa >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_2|185_aa MDPNQEEIPDLPEKEFRRLVIKLIREGPEKGKAQCKEIQKMIQEVKEEIFKEIDSLKKKQ KVQETLDTLLEMQNALESLSNRTEQVEERNSELKDKVFRLIQSNKDKEKTIRKYEQLGMV AHACNPSTLGGRVLEVLARAVRQEKETKGIQIRKEEVKLSLFADDVIIYLGNPKDSSRKL LELIK >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_2|558_bp atggatccaaaccaagaagaaatccctgatttacctgaaaaagaattcaggaggttagtt attaagctaatcagggagggaccagagaaaggtaaagcccaatgcaaggaaatccaaaaa atgatacaagaagtgaaggaagaaatattcaaggaaatagatagcttaaagaaaaaacaa aaagttcaggaaacgttggacacacttttagaaatgcaaaatgctctggaaagtctcagc aatagaactgaacaagtagaagaaagaaattcagagctcaaagacaaggtcttcagatta atccaatccaacaaagacaaagaaaaaacaataagaaaatatgaacagctgggcatggtg gctcatgcctgtaatccaagcactttgggaggtcgagtactggaagtcttagccagagca gtcagacaagagaaagaaacaaagggcatccaaatccgtaaagaggaagttaaactgtca ctgtttgctgacgacgtgatcatttaccttggaaaccctaaggactcctccagaaagctc ctagaattgataaaataa >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_3|223_aa MDKAVFPSLDDISKALDKQAFKYYPSTRGLTYTVLPSWVKNLAQYGKPIKNMCRDDPTYF AQQQKEGTALAIDSNSCFEIQLLSFMGLFICDTRFPAGTGFLPEMPGSRPAALPMCIITL GCHWLGCDSSLLRKSQHTPLDLRGLVKSREEPAVSKLRTVGMGELARQPQEAAAQCSFSG YLLTLCSQFPFQEIRSSAPYPAWNVLLLLVLLSAWGGETHPIT >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_3|672_bp atggacaaggcagtcttcccaagtttggatgacattagcaaggccctggacaagcaggct tttaagtattacccgtctactcgcggcctgacttacactgttttacccagctgggtcaag aacttagcacagtatggaaagcccattaagaacatgtgcagagatgaccccacctacttt gcccagcagcaaaaagaaggtactgccctggcaattgactccaattcttgttttgaaatc caacttctgtcctttatgggactcttcatctgcgatactcgctttccagctgggactggt ttccttccagaaatgccaggttcaaggcctgctgccctcccgatgtgcatcatcacactt ggctgtcattggctgggctgtgatagctccctcctgagaaagagccagcacacccctcta gaccttagaggacttgtcaagtctagggaagagccagctgtgtccaaactcaggaccgtg ggtatgggtgagctggctcgccagccacaggaggcagctgcccagtgttcattttctggc tacctgctcactctgtgctctcagtttcctttccaagagatcaggagcagtgccccttat cctgcctggaatgtgctgcttctccttgtcctgctgtctgcttggggcggagagacccac cccataacataa >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_4|263_aa MGALLHLSAFRQRGSCLNPPVHTDPNPKGPGEGAVSRSGRPRAVAPECGTGPSASAEEEP ILSENSQRGVSRTGSWAGVEPRRASLHRAGPVSVGPVSLLDLCLKEPNDLEHLTTTESHG HSASDQGSLPSRPRRRPELEKNYSKIHMEQKRAQTAKAILSKQNKTNEQTKNKAEDITLP DFKLHYKATVTKTAGSFAGSGQGINPSAKGDSKCNAACLTTGYTGSDLTSDSPDFNLLCR VLDPQSLSAQGFKVWSNHSPAKQ >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_4|792_bp atgggtgctttgctccacttgagtgcattccggcagcgtgggagctgtttgaatccccca gtgcacacagatcccaaccccaagggtccaggggagggagctgtgagcagatccggacgt cccagggctgtggctccggagtgcggaactgggcccagtgcttcagcagaagaggagccc atactctcagaaaactctcagagaggggtgagtcgcacaggttcctgggctggtgtggaa cctaggcgtgcctccctccacagagctggtccagtaagtgtggggcctgtctccctgctg gacctctgcctgaaggagcccaacgacctggaacacctaacaacaacagaaagtcacggc cacagtgccagtgatcaggggtccctcccctcaagaccgaggaggagacctgaattagaa aaaaactattctaaaattcatatggaacaaaaaagagcccaaacagccaaagcaatccta agcaaacaaaacaaaacaaatgaacaaacaaaaaacaaagctgaagacatcacactacca gacttcaaactacactacaaagctacagtaaccaaaacggcagggtcctttgctggcagt ggacagggcatcaatccttcagcaaagggagacagcaaatgcaatgctgcctgtctgacc acgggatatactggttcagatctcactagtgatagcccagacttcaacttgctttgcagg gtattagatccccaatcccttagtgcccagggattcaaagtttggagtaatcacagccct gccaaacagtag >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_5|171_aa MSIICFELYVFNIISPSNNGGNVQETVTIDNEKNTAIINIHAGSCSSTTIFDYKHGYIAS RVLSRRACFILKMDHQNIPPLNNLQWYIYEKQALDNMFSSKYTWVKYNPLESLIKDVDWF LLGSPIEKLCKHIPLYKGEVVENTHNVGAGGCAKAGLLGILGISICADIHV >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_5|516_bp atgagcatcatctgctttgaactatatgtttttaacatcatcagcccaagcaacaatggt ggcaatgttcaggagacagtgacaattgataatgaaaaaaataccgccatcattaacatc catgcaggatcatgctcttctaccacaatttttgactataaacatggctacattgcatcc agggtgctctcccgaagagcctgctttatcctgaagatggaccatcagaacatccctcct ctgaacaatctccaatggtacatctatgagaaacaggctctggacaacatgttctccagc aaatacacctgggtcaagtacaaccctctggagtctctgatcaaagacgtggattggttc ctgcttgggtcacccattgagaaactctgcaaacatatccctttgtataagggggaagtg gttgaaaacacacataatgtcggtgctggaggctgtgcaaaggctgggctcctgggcatc ttgggaatttcaatctgtgcagacattcatgtttag >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_6|211_aa MTTMLRALTEKVDNMQEDMGNAASEISKRDRVMSIYALQKDNLRVNEENALEYKKLVTRV INCMTVVKVQVRGAVGKNINVNDDNNNAGSGQQSVSVNNEHNVANVDNNNGWDSWNSIWD YGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVNP NKVDDLSKFGKNIANMCRGIPTYMAEEMQVP >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_6|636_bp atgactacaatgctaagagcactaacggaaaaagtagacaacatgcaagaggacatgggt aatgctgccagcgaaatttccaagcgtgatagagtcatgtctatctatgcacttcagaaa gacaacctcagggttaatgaagaaaatgcattggaatataagaaactggtgaccagagtg atcaattgcatgactgttgtgaaagtccaggtgaggggagctgtgggcaagaatatcaac gtcaatgatgacaacaacaatgctggaagtgggcagcagtcagtgagtgtcaacaatgaa cacaatgtggccaatgttgacaataacaacggatgggactcctggaattccatctgggat tatggaaatggctttgctgcaaccagactctttcaaaagaagacatgcattgtgcacaaa atgaacaaggaagtcatgccctccattcaatcccttgatgcactggtcaaggaaaagaag cttcagggtaagggaccaggaggaccacctcccaagggcctgatgtactcagtcaaccca aacaaagtcgatgacctgagcaagttcggaaaaaacattgcaaacatgtgtcgtgggatt ccaacatacatggctgaggagatgcaagtgccatag >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_7|459_aa MRSCPQATLPTSPARGESVFARLAWGGGDNSRHLLVQNEVQVKSAVAGWTPILPASPLPG ADEHQSGQREPLEADAEGSPLRRGHGPQPGESRVRSFPRSANSPRPSGTRHPRALPRGSF IPPRSLATPRKQLSGGVQGFPSGFRSDAVSGFRVSRIPILSSASFKGEGCLGPGPPGSSR GRAELLQWKEFRPDGSLSQILNMMGGGGGFKLFLVGAVLCLLSFLSAPLSRSYGEESSKA ATLWTKAAKARVGPAGLDDLKDSEEIYRNQMGKGKQKHWWELVREEKGYDNVMERVFMGL ARNRKELECGKRVEGHLPHFEVKGVGEGAKLMLKAFLKGSDSPGKGRQDQHEEGTLGTAA AAIQLILAAIEESSRLSETCMVLASCRTESDQEGPSDKNILPNIQTCYRFMDDDRRHETS GSETKDFITRVTASSVSMSMFVSVLVAYKSHRSDDVDQD >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_7|1380_bp atgcgctcctgtccccaggcgaccctgccaacgagcccggcccggggcgaaagcgttttc gcccgcttagcctggggtgggggggacaactcgcggcacttacttgtccaaaatgaagta caggtcaaatccgccgtagcaggctggacccccatcctccctgcgtcccccttgcccggc gcagatgagcaccagagtggccaaagagagccactggaagccgatgccgagggctctccg ctccgccgtggccatggcccgcagcccggggagagcagggtccgctccttcccacgctcc gcgaactcgccacgaccctcagggacgcgccatccgcgggcccttcctcgcgggtccttt attccccctcgctccctcgcaactccccggaagcaattgtctggaggagttcaaggtttc ccctctggtttccgctccgatgctgtttctgggtttcgggtttcaaggatcccaatcctg tcctccgcttcttttaaaggggagggctgcctcggtccggggccgccgggcagctcccga ggccgcgccgagcttttgcaatggaaggagttccgtcccgacggttctttgtcccagatt ttaaatatgatgggggggggggggggtttcaaattgttcttggtgggggcagtgctttgc cttctctccttcctctctgcccccctttcgcgttcttacggggaagaaagttcaaaggct gccaccttgtggacaaaggcggcgaaagcacgcgtgggacccgcgggactcgatgacttg aaagacagtgaggaaatctatagaaatcagatgggtaagggaaagcagaaacactggtgg gagctggtaagagaagaaaagggatatgacaatgtgatggagagagtgtttatgggattg gcaaggaataggaaagagcttgaatgtggaaagcgggtggagggacaccttccccacttt gaagtgaaaggggttggagagggagctaagctaatgctaaaagcctttttaaagggctct gacagccctggaaaggggagacaggaccaacacgaagagggaactctggggacggcagca gctgcaattcagcttattctggcagccatcgaggagtcttccaggctcagtgagacatgt atggttctggcttcatgtagaactgagagtgaccaggaaggcccttcagacaaaaatata ctcccaaatatacagacttgctatcgtttcatggatgatgacagaagacatgagacttct ggatcagagacaaaggactttattactcgtgtcacagcaagcagtgtgagcatgagcatg tttgtgtcagtgctcgttgcctataagtctcatagaagtgatgatgtagaccaagattga >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_8|118_aa MSAETIDGDGDGYGGGGGRDDKDDGDGSGDGDSDSGDYGNVDSCDEDDDGDKDNGDGGGG DSDSDDDNGSSDDSDGGGSDDGDGDMVVVVEVEMMMMMMMMAVVMVVVVLVQKQNMVL >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_8|357_bp atgagtgctgagacgatagatggtgatggtgatggttatggtggtggaggtggacgtgat gataaagatgatggtgatggtagtggtgatggtgacagtgatagtggtgattatggcaat gttgatagttgtgatgaagatgatgatggtgataaggataatggtgatggtggtggtggt gatagtgatagtgatgatgataatggtagtagtgatgatagtgatggtggtggtagtgat gatggtgatggtgatatggtggtggtggtggaggtggagatgatgatgatgatgatgatg atggcagtggtcatggtggtggtggtgttagtgcagaaacagaatatggtcctctaa >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_9|115_aa MKRKSLYCSWKLPDLLPTEPQLFCLGLDKQPGFGTQMALANSRDLREIQHKSTVVKNTDN GLVTQKVFPIIEMHSCGSIPHTRQIHRDEATVATLHPPERSKGDGSTSSGESGCG >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_9|348_bp atgaagaggaaaagcctctattgctcatggaagcttcctgatcttttgccaactgaacct caactattctgccttggcttggataaacagcctgggtttgggactcagatggctctcgcc aattcccgtgacttacgtgaaattcaacacaaaagcactgttgttaagaatacagacaat gggttagtaacccaaaaggtatttcccataattgaaatgcacagctgcggctccattcct cacactcgccagattcacagagatgaagccacagtagccactttgcatcccccagaaaga tcaaagggggatgggtcaacatcgtcgggtgagagtggctgtggatga >gi568815596f:68874636_69080820|GENSCAN_predicted_peptide_10|195_aa MEPTAGEDAMNIVEITRNLEYSINLVDRAATRFERIRSNFERSSMASEQIYYENRQGYRT ASVIIALTDGELHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFPV NDGFQALQGIIHSGLLGAPAEWNAEWEVRSSEPGNPEFHWPTYLTLRDPWTFLVWKQGLM MTYIIGLFEDYKDYW >gi568815596f:68874636_69080820|GENSCAN_predicted_CDS_10|588_bp atggaacctactgctggtgaagatgctatgaacattgttgaaattacaaggaacttagaa tattccataaacttagttgatagagcagcaacaaggtttgagaggattcgctccaatttt gaaagaagttctatggccagtgagcagatttattatgaaaacagacaagggtacaggaca gccagcgtcatcattgctttgactgatggagaactccatgaagatctctttttctattca gagagggaggctaataggtctcgagatcttggtgcaattgtttactgtgttggtgtgaaa gatttcaatgagacacagctggcccggattgcggacagtaaggatcatgtgtttcccgtg aatgacggctttcaggctctgcaaggcatcatccactcaggtttgctaggagctccagca gagtggaatgcagagtgggaagtaagaagctcagagcccggcaacccagagtttcattgg cccacgtacttaacacttagagatccttggacatttcttgtttggaaacagggactgatg atgacttacatcataggcttgtttgaggattacaaggattactggtga