GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:15:25 Sequence gi568815597f:51136757_51371319 : 234563 bp : 44.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1517 1512 6 1.05 1.02 Term - 21379 21215 165 1 0 49 52 310 0.944 21.42 1.01 Init - 21715 21419 297 1 0 108 -3 658 0.997 55.92 1.00 Prom - 28606 28567 40 -3.26 2.00 Prom + 36302 36341 40 -3.46 2.01 Init + 47622 47630 9 2 0 76 97 14 0.362 0.79 2.02 Term + 93164 93355 192 1 0 95 40 181 0.627 11.42 2.03 PlyA + 93557 93562 6 1.05 3.03 PlyA - 94034 94029 6 1.05 3.02 Term - 96605 96557 49 1 1 121 48 25 0.273 -1.42 3.01 Init - 99757 99504 254 1 2 100 40 258 0.862 17.01 3.00 Prom - 112305 112266 40 -3.86 4.02 PlyA - 113383 113378 6 1.05 4.01 Sngl - 114669 113842 828 0 0 63 46 1106 0.999 99.64 4.00 Prom - 136792 136753 40 -3.36 5.22 PlyA - 136810 136805 6 1.05 5.21 Term - 151524 151401 124 2 1 83 33 74 0.800 -0.74 5.20 Intr - 152199 152083 117 2 0 76 80 173 0.861 14.88 5.19 Intr - 153363 153249 115 1 1 79 108 106 0.999 11.21 5.18 Intr - 153869 153758 112 2 1 78 62 199 0.673 16.25 5.17 Intr - 157755 157635 121 2 1 124 68 181 0.945 20.20 5.16 Intr - 159414 159323 92 0 2 107 72 156 0.990 14.69 5.15 Intr - 164977 164816 162 1 0 2 86 307 0.954 22.07 5.14 Intr - 165660 165601 60 0 0 75 73 49 0.648 1.03 5.13 Intr - 165817 165750 68 2 2 110 105 179 0.853 20.42 5.12 Intr - 166436 166328 109 2 1 123 86 147 0.999 17.86 5.11 Intr - 169320 169221 100 2 1 66 105 117 0.710 11.31 5.10 Intr - 172569 172505 65 0 2 105 89 114 0.615 10.72 5.09 Intr - 174565 174498 68 2 2 96 91 80 0.996 7.72 5.08 Intr - 175439 175363 77 1 2 129 70 73 0.999 8.76 5.07 Intr - 176187 176056 132 2 0 92 31 203 0.986 14.76 5.06 Intr - 185069 184965 105 1 0 122 94 173 0.982 20.63 5.05 Intr - 189488 189362 127 0 1 74 42 45 0.007 -1.76 5.04 Intr - 198972 198892 81 1 0 111 80 38 0.002 4.91 5.03 Intr - 208040 207954 87 0 0 93 89 38 0.042 4.34 5.02 Intr - 208368 208126 243 1 0 66 61 107 0.495 3.27 5.01 Init - 215881 215875 7 1 1 81 87 0 0.124 0.29 5.00 Prom - 218352 218313 40 -4.96 6.04 PlyA - 218505 218500 6 1.05 6.03 Term - 220090 219944 147 1 0 29 49 171 0.976 5.20 6.02 Intr - 224537 224415 123 2 0 47 87 58 0.411 2.38 6.01 Intr - 229273 229197 77 2 2 81 84 46 0.672 2.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 48962 49033 72 1 0 80 39 88 0.894 1.01 S.002 Term + 134395 134566 172 1 1 106 52 110 0.990 6.50 S.003 Init + 193905 193976 72 2 0 97 94 99 0.862 10.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_1|153_aa MASGVAVSDGVIKVFKDMKMHKSSMPEEVKKCKKVVFFCLSEDKKNIILEEGKEILVGDV GQTVNNLYATFVKMLPYKDYRYTLYDTTYETKESKEEDLSKIIYASSKDAIKKKLTGIKH ALQANCYEDVKDHCTLAEMLGGSAIISLEGKPL >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_1|462_bp atggcctcaggggtggccgtctctgatggtgtcatcaaggtgttcaaggacatgaagatg cataagtcttcaatgccagaggaggtgaagaagtgcaagaaggtggtgttcttctgcctg agtgaggacaagaagaacattatcctggaggagggcaaggagatcctggtgggcgatgtg ggccagactgtcaacaacctctatgccacctttgtcaagatgctgccatataaggactac cgctacaccctctatgacacaacctacgagaccaaggagagcaaggaggaggacctgagc aaaataatctatgccagctccaaggacgccatcaagaagaagctgacagggatcaagcat gcattacaagcaaactgctacgaggatgtcaaggaccactgcaccctggcagagatgctg gggggcagtgccatcatctccctggagggcaagcctttgtga >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_2|66_aa MVKNRSCKLQMVLQMEPQMQPMTKIYWGPLDWPASPCSDVDDIEGNPPEEISTAQPLLHP NSAVSS >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_2|201_bp atggtgaagaatcgaagctgtaaactacaaatggttcttcaaatggagccccagatgcag cccatgactaagatctactggggtcccctggactggcctgctagcccatgctccgatgtt gatgacatcgaaggcaaccctccagaggaaatctcaactgcacaacccctacttcacccc aattcagcagtaagcagttag >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_3|100_aa MGGGGRGDEALPHPAPPRGGGGHGAAGPNPGQRGGARPSRLPQLLPQLLLPPPTPEPRPR ASSYSASARRRQPERGRLGHCVIAPWVTLTQALEKFYSAL >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_3|303_bp atgggggggggggggcggggagacgaagcacttccccaccccgcccccccgcggggcggc ggcggccacggagctgctgggcctaatccaggtcagcgggggggggcgcggccgtcgcgg ctgcctcagctgctgcctcagctgctgctgcccccgccgacgccggagccacggccgcgg gcctcatcctactccgcctcagcgagacggcggcagccagaacgaggccggctcgggcac tgcgtcatcgcgccgtgggtaacactgacacaagcccttgaaaagttctatagtgccttg tga >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_4|275_aa MGNCGGFRGGFGSGIRGRGRSRRRGRGRGRGARGGKAEDKEWMPVTKLGCLVKDMKIKSL EEIYLFSLPIKESEIIDFFLGASLKDEVLKIMPVQTQTRAGQRTRFKAFVAIGDYNGHVG LGVKCSKEVATAIHGAIILAKLSIVPVRRGYWGNKIGKPHTVPCKVTGRCGSALVHLIPV PRGTGIVSAPVPKKLLMMAGIDDCCTSAWGCTATLGNFAKATFDAISKTYSYLTPDLWKE TVFTKSPDQEFTDHLIKAHARVSVQRTQAPAVATT >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_4|828_bp atggggaactgcggtggcttccgcggaggtttcggcagtggcatccggggccggggtcgc agccgtagacggggccggggccgaggccgcggagctcgcggaggcaaggccgaggataag gagtggatgcccgtcaccaagctgggctgcttggtgaaggacatgaagatcaagtccctg gaggagatctatctcttctccctgcccattaaggaatcagagatcattgactttttcctg ggggcctctctcaaggatgaggttttgaagattatgccagtgcagacgcagacccgtgct ggccagcgcaccaggttcaaggcgtttgttgctatcggggactacaatggccacgtcggt ctgggtgttaagtgctccaaggaggtggccaccgccatccatggggccatcatcctggcc aagctctccattgtccccgtgcgcagaggctactgggggaacaagattggcaagccccac accgtcccttgcaaggtgacaggccgctgcggctctgcactggtgcacctcatccctgta cccaggggcactggcattgtctccgcacctgtgcccaagaagctgctcatgatggctggt atcgatgactgctgcacctcagcctggggctgcactgccaccctgggcaacttcgccaag gccacctttgatgccatttctaagacctacagctacctgacccccgacctctggaaggag actgtatttaccaagtctcccgatcaggaattcactgaccacctcatcaaggcccacgcc agagtctccgtgcagcggacccaggctccagctgtggctacaacatag >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_5|723_aa MKGAATVASTAAAAAARGRPRPRPSPARAHGLSDHLEGGGLREKRPQVPGRGGDSAEGAD GTKGPPEDPADVGETAERHSWGKAGRELGRVDAVPPERRSVSQDAGYLRSQLENKESRPG FELRLDVGPSLYFLICRMGASSVPQAELEVSSGPLWPPVIPMITALSTVAVTALSASVTC RLTPESSLHEALDQCMTALDLFLTNQFSEALSYLKPRTKESMYHSLTYATILEMQAMMTF DPQDILLAGNMMKEAQMLCQRHRRKSSVTDSFSSLVNRPTLGQFTEEEIHAEVCYAECLL QRAALTFLQDENMVSFIKGGIKVRNSYQTYKELDSLVQSSQYCKGENHPHFEGGVKLGVG AFNLDYGLLQLEEGASGHSFRSVLCVMLLLCYHTFLTFVLGTGNVNIEEAEKLLKPYLNR YPKGAIFLFFAGRIEVIKGNIDAAIRRFEECCEAQQHWKQFHHMCYWELMWCFTYKGQWK MSYFYADLLSKENCWSKATYIYMKAAYLSMFGKEDHKPFGDDEVELFRAVPGLKLKIAGK SLPTEKFAIRKSRRYFSSNPISLPVPALEMMYIWNGYAVIGKQPKLTDGILEIITKAEEM LEKGPENEYSVDDECLVKLLKGLCLKYLGRVQEAEENFRSISANEKKIKYDHYLIPNALL ELALLLMEQDRNEEAIKLLESAKQNYKNYSMESRTHFRIQAATLQAKSSLENSSRSMVSS VSL >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_5|2172_bp atgaaaggtgctgccactgtagcctccacggccgccgccgccgccgcgaggggccgcccg cggccccgtccgagcccggcccgcgcccatggcctttctgaccacctggaaggaggcggc ctccgggaaaagcgaccgcaggtaccgggcaggggtggggactcggcggaaggcgcggac gggaccaaggggccgccggaggacccagccgacgtcggggaaacagctgagaggcacagc tggggaaaggcagggcgggaactggggagggtagacgcggtcccacctgagcgccgctca gtgtcgcaagacgcgggctatctgcgaagtcagctcgagaataaggaatcaagacctggg ttcgagctacggctagatgttggtccaagcctctattttctcatctgtaggatgggagcc tcctcagtgccccaggctgagttagaggtgtcttctgggcccctttggcccccagtgatt cccatgatcacagccctgagcactgtagcggtcactgccctgtctgcatctgtcacctgt aggctgactcctgagagcagcctccatgaggccctggaccagtgcatgaccgccctggac ctcttcctcaccaaccagttctcagaagcactcagctacctcaagcccagaaccaaggaa agcatgtaccactcactgacatatgccaccatcctggagatgcaggccatgatgaccttt gaccctcaggacatcctgcttgccggcaacatgatgaaggaggcacagatgctgtgtcag aggcaccggaggaagtcttctgtaacagattccttcagcagcctggtgaaccgccccacg ctgggccaattcactgaagaggaaatccacgctgaggtctgctatgcagagtgcctgctg cagcgagcagccctgaccttcctgcaggacgagaacatggtgagcttcatcaaaggcggc atcaaagttcgaaacagctaccagacctacaaggagctggacagccttgttcagtcctca caatactgcaagggtgagaaccacccgcactttgaaggaggagtgaagcttggtgtaggg gccttcaacctggactatgggctgctgcagctggaggagggagcgtcagggcacagcttc cgctctgtgctctgtgtcatgctcctgctgtgctaccacaccttcctcaccttcgtgctc ggtactgggaacgtcaacatcgaggaggccgagaagctcttgaagccctacctgaaccgg taccctaagggtgccatcttcctgttctttgcagggaggattgaagtcattaaaggcaac attgatgcagccatccggcgtttcgaggagtgctgtgaggcccagcagcactggaagcag ttccaccacatgtgctactgggagctgatgtggtgcttcacctacaagggccagtggaag atgtcctacttctacgccgacctgctcagcaaggagaactgctggtccaaggccacctac atttacatgaaggccgcctacctcagcatgtttgggaaggaggaccacaagccgttcggg gacgacgaagtggaattatttcgagctgtgccaggcctgaagctcaagattgctgggaaa tctctacccacagagaagtttgccatccggaagtcccggcgctacttctcctccaaccct atctcgctgccagtgcctgctctggaaatgatgtacatctggaacggctacgccgtgatt gggaagcagccgaaactcacggatgggatacttgagattatcactaaggctgaagagatg ctggagaaaggcccagagaacgagtactcagtggatgacgagtgcttggtgaaattgttg aaaggcctgtgtctgaaatacctgggccgtgtccaggaggccgaggagaattttaggagc atctctgccaatgaaaagaagattaaatatgaccactacttgatcccaaacgccctgctg gagctggccctgctgcttatggagcaagacagaaacgaagaggccatcaaacttttggaa tctgccaagcaaaactacaagaattactccatggagtcaaggacacactttcgaatccag gcagccacactccaagccaagtcttccctagagaacagcagcagatccatggtctcatca gtgtccttgtag >gi568815597f:51136757_51371319|GENSCAN_predicted_peptide_6|115_aa XTDPFASVFGNESFGGGFADFSTLSKPFPGNDSPKEKDPEIFCDPFTSATTTTNKEADPS NFANFSAYPSEEDMIEWAKRESEREEEQRLARLNQQEQEDLELAIALSKSEISEA >gi568815597f:51136757_51371319|GENSCAN_predicted_CDS_6|348_bp nccacagacccctttgcttctgtttttgggaatgaatcatttggaggtggatttgctgac ttcagcacattgtcaaagcctttcccaggcaacgatagccccaaagaaaaagatcctgaa atattttgtgatccattcacttctgctactaccactaccaataaagaggctgatccaagc aattttgccaacttcagtgcttatccctctgaagaagatatgatcgaatgggccaagagg gaaagtgagagagaggaagagcagaggcttgcccgactaaatcagcaggaacaagaagac ttagaactggctattgcactcagcaaatctgagatatcagaagcatga