GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:02:47 Sequence gi568815589f:96350280_96590035 : 239756 bp : 43.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2026 2021 6 1.05 1.04 Term - 3601 3515 87 1 0 86 49 64 0.059 -0.04 1.03 Intr - 14271 14185 87 0 0 107 115 17 0.169 6.47 1.02 Intr - 18026 17993 34 1 1 91 109 7 0.104 1.33 1.01 Init - 33355 33198 158 1 2 70 76 418 0.629 36.18 1.00 Prom - 35597 35558 40 -4.46 2.05 PlyA - 35686 35681 6 1.05 2.04 Term - 38180 37958 223 1 1 78 33 299 0.999 19.79 2.03 Intr - 42279 42119 161 0 2 32 55 152 0.025 5.09 2.02 Intr - 44663 44544 120 2 0 72 64 91 0.025 5.79 2.01 Init - 67753 67334 420 1 0 67 105 737 0.908 69.69 2.00 Prom - 77530 77491 40 -3.76 3.00 Prom + 81832 81871 40 -3.86 3.01 Init + 100001 100349 349 1 1 92 75 649 0.977 59.35 3.02 Intr + 108100 108262 163 2 1 84 53 128 0.899 7.93 3.03 Intr + 115058 115219 162 2 0 90 101 27 0.751 3.29 3.04 Intr + 115431 115499 69 0 0 92 96 26 0.656 2.10 3.05 Intr + 134183 134354 172 2 1 59 69 175 0.417 12.65 3.06 Intr + 137720 137995 276 1 0 61 66 122 0.652 5.01 3.07 Intr + 138539 138708 170 1 2 106 36 128 0.418 8.14 3.08 Intr + 141472 141622 151 1 1 52 19 107 0.130 0.36 3.09 Intr + 142741 142852 112 0 1 70 49 34 0.039 -2.35 3.10 Intr + 146378 146592 215 0 2 28 46 186 0.568 6.83 3.11 Term + 154077 154280 204 2 0 114 40 119 0.818 7.07 3.12 PlyA + 156250 156255 6 1.05 4.13 PlyA - 157138 157133 6 1.05 4.12 Term - 158570 158309 262 1 1 96 37 50 0.211 -4.30 4.11 Intr - 159510 159394 117 2 0 72 100 21 0.205 1.28 4.10 Intr - 163321 163240 82 2 1 70 92 120 0.494 9.30 4.09 Intr - 164798 164791 8 1 2 126 98 0 0.413 -1.82 4.08 Intr - 172324 172227 98 1 2 49 94 66 0.367 2.11 4.07 Intr - 173141 172982 160 1 1 39 46 83 0.444 -0.81 4.06 Intr - 173446 173308 139 2 1 90 95 99 0.641 10.32 4.05 Intr - 183834 183648 187 0 1 17 95 224 0.365 15.26 4.04 Intr - 188861 188799 63 2 0 59 100 53 0.372 2.51 4.03 Intr - 214573 214498 76 0 1 45 98 56 0.060 1.82 4.02 Intr - 215204 215114 91 0 1 47 100 10 0.036 -2.85 4.01 Init - 216537 216489 49 0 1 80 105 44 0.117 5.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 42257 42119 139 0 1 37 55 165 0.965 7.62 S.002 Init + 209405 209470 66 1 0 73 106 35 0.898 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:96350280_96590035|GENSCAN_predicted_peptide_1|121_aa MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLIGRSGSHLEAGGDGLAFGSPGSLLTM R >gi568815589f:96350280_96590035|GENSCAN_predicted_CDS_1|366_bp atgacggccggcggccaggccgaggccgagggcgctggcggggagcccggcgcggcgcgg ctgccctcgcgggtggcccggctgctgtcggcgctcttctacgggacctgctccttcctc atcgtgcttgtcaacaaggcgctgctgaccacctacggtttcccgtcaccaattttcctt ggaattggacagatggcagccaccataatgatactatatgtgtccaagctaaacaaaatc attcacttccctgattttgataagaaaattcctgtaaagctcatcggccgctcagggtcg catctagaagcaggtggagatgggctggcatttggcagccctggcagtctgttgactatg cgctga >gi568815589f:96350280_96590035|GENSCAN_predicted_peptide_2|307_aa MIRGFEAPMAENPPPPPPPVIFCHDSPKRVLVSVIRTTPIKPTCGGGGEPEPPPPLIPTS PGFSDFMVYPWRWGENAHNVTLSPGAAGAAASAALPAAAAAEHSGLRGRGAPPPAASASA AASGGEDEEEASSPDSGHLKVRGPICVTIQTVEKPLFKVDSSKHISVFTPERNLLFVQKM VLTLCFEGCLSRFTHANRHCPKHPYARLKREEPTDTLSKHQAADNKAAAEWLARYWEMRE QRTPTLKGKLVQKADQEQQDPLEYLQSDEEDDEKRGAQRRLQEQRERLHGALALIELANL TGAPLRQ >gi568815589f:96350280_96590035|GENSCAN_predicted_CDS_2|924_bp atgatccggggcttcgaggcgcccatggcggagaacccgccgccgccgccgccgcccgtc atcttctgccacgactccccgaagcgggtgctggtgtcggtcatcaggacgaccccgatc aagccaacgtgcggcggtggaggggagccggagccgccgccgccgctcatccccaccagc cccggcttcagcgacttcatggtgtacccgtggcgctggggcgagaacgcacacaacgtg acgctcagccctggggccgcgggggccgccgcctcggccgccctgcctgcagccgcagcc gccgagcactcggggcttcgtggccggggcgcgcccccgcccgccgcctcggcctccgcc gccgcctcgggaggtgaggacgaggaggaagcgagcagcccagacagcggccacctcaag gtgagaggccctatctgtgtgactatccagactgtggaaaagcctttgttcaaagtggac agctcaaaacacatcagcgtcttcacaccggagagaaaccttttgtttgttcagaaaatg gtgttaaccttgtgttttgaaggctgcctgagcagattcacccatgcaaaccgccactgt ccgaagcacccctacgccaggctgaagagagaggagcccacggacacactcagcaaacat caggctgccgacaacaaggccgcggccgagtggctggcgaggtattgggaaatgagagag cagcgcacccccactttgaaaggcaagctggttcagaaggctgatcaggagcagcaggac cctctggaataccttcagtctgatgaagaggacgacgagaagagaggggcccagcgccgg ctgcaggagcagcgggagcgcctgcatggagccctcgcgctcatagagcttgccaacctg actggggcgccactccgacagtag >gi568815589f:96350280_96590035|GENSCAN_predicted_peptide_3|680_aa MKGALGSPVAAAGAAMQESFGCVVANRFHQLLDDESDPFDILREAERRRQQQLQRKRRDE AAAAAGAGPRGGRSPAGASGHRAGAGGRRESQKERKSLPAPVAQRPDSPGGGLQAPGQKR TPRRGEQQGWNDSRGPEGMLERAERRSYREYRPYETERQADFTAEKFPDEKPGDRFDRDR PLRGRGGPRGGMRGRGRGGPGNRVFDAFDQRGKREFERYGGNDKIAVRTEDNMGGCGVRT WGSGKDTRVPELEVEEETQVQEMTLDEWKNLQEQTRPKPEFNIRKPESTVPSKAVVIHKS KYRDDLLCSTLQPLSQMSVGMAVDLCVFDAVIVFQMVKDDYEDDSHVFRKPANDITSQLE INFGNLPRPGRGARGGTRGGRGRIRRAENYGPRAEVVVSLITSPRPIKVSEQRSFFSLKP DLRVRLYSHWCGYRSAAAFAHYRFTQCHAVPGMYCKNTDSSYPIELLSHVVEDSFGEPGA RSEPVMGYVCLPGLALADLVVCLDTSDHFLLLISKSMPFQHMGASREGDCCPLKLYCAVL AGAGKGDPAGSTAQATLKSPLEEVQRTRSRGARPAPSLREAPSSSSFLCPPAREHRGGFS VLCTAQDKRHWKRFTTLHFQVSNNFQKANKTKIRVRKTTSNLEGSSPDAAAAALFKEKRL NRSFRFQRSQQSTESKIQTS >gi568815589f:96350280_96590035|GENSCAN_predicted_CDS_3|2043_bp atgaagggcgctctggggagtcccgtggctgccgctggcgccgcgatgcaggagagtttc ggctgcgtggtggccaaccgcttccatcagctgctggacgacgagtcggacccgttcgac atcctgcgcgaggccgagcgccggcgccagcagcagctgcagcgcaagaggcgcgacgag gcggcggcggcggccggggccggtccccgcggcggcaggagcccagccggggcctcgggc cacagagccggcgcgggcggccggagggagtcgcagaaggagcgcaagagcctcccggcg cccgtcgctcagcggcccgatagccccgggggcggcctgcaggcgccgggccagaagcgg actcctagaagaggggagcagcaaggatggaatgacagccgtgggccggaggggatgctc gaaagagctgagcggagatcctacagggaataccgaccctatgagacagagaggcaggca gacttcacagctgagaagtttccagatgaaaaaccaggtgataggtttgatcgagacaga ccgttgagaggacgtggaggcccgagagggggtatgcgcggcagaggcagaggtggccct gggaacagagtttttgacgcttttgaccagagaggaaagcgagaatttgaaagatatggt gggaatgacaaaatagcagtcagaactgaagacaacatgggtggatgtggagttcgaacc tggggatcgggtaaagataccagagttcctgagttggaggtagaagaagaaacccaagtt caagagatgactttagatgagtggaaaaatcttcaagaacagaccagaccaaagcctgag tttaacatccggaaaccagaatccactgttccttccaaagccgtggtgattcacaagtca aaatacagagatgatctcctgtgtagcacactgcagccactaagccagatgagtgtgggg atggctgtggacttgtgcgtgtttgatgctgtaattgtgtttcagatggtaaaagatgac tatgaggacgattcccatgttttccggaaacccgccaatgacatcacatcccagctggag attaattttggtaacctccctcgtcctgggcgtggagccagaggaggcacccggggaggc cggggaaggatcaggagggcagagaactatggacccagagcagaagtggtggtttctttg atcacatctcctagaccgatcaaggtctcggagcaaaggagtttcttttcattgaagcct gatctccgtgtgcggctgtattctcactggtgtggctatcggtcagcggctgcttttgct cattaccggtttacgcagtgccacgcagtgcctggcatgtattgcaaaaacactgacagt tcctatcccatagagctgttgagtcatgtggttgaagacagttttggggaacccggtgca cgttctgaacctgtaatgggctatgtttgcttaccagggctagctctcgctgacttggtc gtgtgcctggatacttcagaccatttccttctgttgatcagcaagtccatgccattccag cacatgggagcttcacgggaaggagactgctgtcctctgaaactgtactgtgcggtgctg gcaggcgcaggcaagggagaccccgctggctccaccgcccaggccaccctgaagtccccg ctggaggaggtgcagcggacccgctcccggggagcccgtcccgcgccctccctccgagaa gccccttcctccagctccttcctctgcccgccggcccgagagcaccgaggcggcttctct gtcctgtgtactgcccaggacaagcggcattggaagcgattcacgacgctacactttcag gtcagtaacaactttcagaaagcaaacaaaaccaaaataagagtcagaaaaaccacttcc aatctggagggctcctcgcccgatgctgctgctgctgcgctcttcaaggagaagcgtttg aatcgatcatttcgttttcagagatcccaacagagcactgagtcaaagatccagaccagc tag >gi568815589f:96350280_96590035|GENSCAN_predicted_peptide_4|443_aa MSREGAGAALVAEVIKDRLCFAILYSRPKSASNVHYFSIDNELEYENFYADFGPLNLAMV YRYCCKINKKLKAMQYGFLNFNSFNLDEYEHYENHNVTTIIRLNKRMYDAKRFTDAGFDH HDLFFADGSTPTDAIVKEFLDICENAEGAIAVHCKAGLGRTGTLIACYIMKHYRMTAAET IAWVRICRPGSVIGPQQQFLVMKQTNLWLEGDYFRQKLKGQENGQHRAAFSKLLSGVDDI SINGVENQDQQEPEPYSDDDEINGVTQGDRLRALKSRRQSKTNAIPLTEYNFTISEISEY RGSKDAAEKKVNWNIGKRVILQSSVQSCKTSEPNISGSAGITKRTTRSASRKSSVKSFPA MYIRGFGPRGLISVKLVPSAKQPVSQETLAELGVLCALPFEDSFSSAMSPLHCFVQPYCT LNTRMGMQQCPGIRIGGLSGTDS >gi568815589f:96350280_96590035|GENSCAN_predicted_CDS_4|1332_bp atgagccgggagggcgcgggggcagctttggtagccgaggtgatcaaagatcgcctttgt tttgccattctctacagcagaccaaagagtgcatcaaatgtacattatttcagcatagat aatgaacttgaatatgagaacttctacgcagattttggaccactcaatctggcaatggtt tacagatattgttgcaagatcaataagaaattaaaggcaatgcagtatggcttccttaat ttcaactcatttaaccttgatgaatatgaacactatgaaaatcacaatgttactaccatt attcgtctgaataaaaggatgtatgatgccaaacgctttacggatgctggcttcgatcac catgatcttttctttgcggatggcagcacccctactgatgccattgtcaaagaattccta gatatctgtgaaaatgctgagggtgccattgcagtacattgcaaagctggccttggtcgc acgggcactctgatagcctgctacatcatgaagcattacaggatgacagcagccgagacc attgcgtgggtcaggatctgcagacctggctcggtgattgggcctcagcagcagtttttg gtgatgaagcaaaccaacctctggctggaaggggactattttcgtcagaagttaaagggg caggagaatggacaacacagagcagccttctccaaacttctctctggcgttgatgacatt tccataaatggggtcgagaatcaagatcagcaagaacccgaaccgtacagtgatgatgac gaaatcaatggagtgacacaaggtgatagacttcgggccttgaaaagcagaagacaatcc aaaacaaacgctattcctctcacggaatacaatttcactatatctgaaatctcagagtac cgaggctccaaggatgctgcagagaagaaggtgaactggaatataggcaagagagtaatt cttcaatccagtgttcagagctgtaaaacatctgaacctaacatttctggcagtgcaggc attactaaaagaaccaccagatctgcttcaaggaaaagcagtgttaaaagcttccctgcc atgtatatccgaggctttgggcctaggggccttatcagtgtgaaattagtccccagtgca aagcagccagtctcccaagagaccttggcagagctgggagttctgtgtgctttgcctttt gaagactcattcagctctgccatgtctcctctacactgttttgtacaaccttactgcaca cttaacactcgcatggggatgcagcagtgccccggcataaggattggaggactgtcaggc actgactcatga