GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:08:12 Sequence gi568815596f:201934648_202136369 : 201722 bp : 45.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7048 7065 18 0 0 88 110 9 0.127 3.36 1.02 Intr + 10386 10496 111 2 0 100 92 14 0.307 3.58 1.03 Intr + 15041 15154 114 1 0 126 93 1 0.329 5.04 1.04 Term + 21896 22120 225 1 0 60 44 118 0.042 1.38 1.05 PlyA + 23241 23246 6 1.05 2.00 Prom + 24591 24630 40 -5.06 2.01 Init + 30609 30657 49 2 1 70 81 86 0.518 7.21 2.02 Intr + 50950 51043 94 2 1 119 50 48 0.259 3.12 2.03 Intr + 51698 51850 153 2 0 93 78 69 0.170 5.59 2.04 Term + 64719 64887 169 0 1 66 47 76 0.061 -1.35 2.05 PlyA + 65614 65619 6 1.05 3.00 Prom + 87426 87465 40 -2.46 3.01 Sngl + 100001 101725 1725 1 0 78 52 3442 0.976 332.52 3.02 PlyA + 102828 102833 6 1.05 4.03 PlyA - 103075 103070 6 1.05 4.02 Term - 108121 108045 77 2 2 110 43 47 0.350 0.40 4.01 Init - 117366 117303 64 0 1 45 81 105 0.231 4.83 4.00 Prom - 122376 122337 40 -3.86 5.00 Prom + 134561 134600 40 -4.36 5.01 Init + 138981 139064 84 2 0 73 105 75 0.938 8.53 5.02 Intr + 140244 140528 285 2 0 88 96 142 0.997 12.54 5.03 Intr + 156123 156282 160 2 1 105 82 83 0.905 8.96 5.04 Intr + 158383 158538 156 2 0 88 73 36 0.593 2.08 5.05 Intr + 162788 162930 143 0 2 97 66 105 0.998 9.27 5.06 Intr + 168299 168467 169 1 1 79 92 150 0.994 14.02 5.07 Intr + 171114 171263 150 1 0 113 65 43 0.935 4.63 5.08 Intr + 174966 175142 177 1 0 78 84 101 0.580 8.59 5.09 Intr + 178689 178799 111 1 0 50 121 29 0.208 2.75 5.10 Intr + 190543 190635 93 2 0 87 121 -2 0.297 2.94 5.11 Intr + 201530 201655 126 0 0 29 101 107 0.061 6.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 72893 72946 54 1 0 67 98 31 0.842 3.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:201934648_202136369|GENSCAN_predicted_peptide_1|155_aa MTKMPQASSLMLTEAKGNAGPKRHTLPKLTFVERRKEKPDLEEDSRFVLMAKNYLCFNLG AWAGSITIIISYLKRPGGKGPGKQQHVYAIWSCQSSPCHLGTPPSPGLCWLLDCISRLSP VIILGDLNLHVNDLSSSLTTLFCALLHTDDPCPIG >gi568815596f:201934648_202136369|GENSCAN_predicted_CDS_1|468_bp atgacgaagatgccacaggcttcaagtctcatgctgacagaagccaaagggaatgctggt ccaaagaggcacacccttcccaagctgacttttgtagagagaaggaaagaaaaacctgac cttgaggaggactccagattcgtgttaatggcgaagaattatctctgtttcaatcttggg gcctgggctggctccataacaattataatctcctatctgaagcggcctggaggcaaaggg ccggggaagcagcagcatgtctatgccatttggtcatgccagtcaagtccctgccatctg ggcacaccgccttctccaggtctctgctggcttctggattgcatttcccgtctttcccct gtcatcatccttggtgacctcaaccttcatgtcaatgacctgtccagctccctgaccacc ctcttctgtgcccttctccacacggatgacccctgtcccataggatag >gi568815596f:201934648_202136369|GENSCAN_predicted_peptide_2|154_aa MPQYDLRLLSSSNSKTEYEQHIFRKAHADKCDYLPSCCSVPVGRVLLRQPLFPLLTNGRK NLLPAGIQGLKGTVDLKKHFWGYEDSWSLRSGADIDDLKLGGFSGMQLTACEHARTQCLA GLQDCKALALPSHCHRIAGNTLGSPAAVLLAAES >gi568815596f:201934648_202136369|GENSCAN_predicted_CDS_2|465_bp atgcctcagtatgacttgcgcctgctcagcagctccaacagcaagactgaatatgagcag cacatattccggaaagcccacgctgacaagtgcgattaccttccaagctgttgtagcgta ccagtggggagagtcctgttgaggcaacccctatttcctttattaactaatggcaggaaa aatttattaccagctggaatccaaggacttaaaggcactgttgacctgaaaaagcatttc tggggatatgaggacagctggagtctgagaagtggcgccgacatagatgaccttaagctg ggtgggttctcgggaatgcagctgactgcatgtgaacacgcacgtacccagtgtttggca gggctgcaggattgcaaggcactggctcttccatcacactgtcatcgcatagcaggaaac accctgggcagcccagctgcagtgctgctggcagcagagagctga >gi568815596f:201934648_202136369|GENSCAN_predicted_peptide_3|574_aa MRDPGAAAPLSSLGLCALVLALLGALSAGAGAQPYHGEKGISVPDHGFCQPISIPLCTDI AYNQTILPNLLGHTNQEDAGLEVHQFYPLVKVQCSPELRFFLCSMYAPVCTVLDQAIPPC RSLCERARQGCEALMNKFGFQWPERLRCENFPVHGAGEICVGQNTSDGSGGPGGGPTAYP TAPYLPDLPFTALPPGASDGRGRPAFPFSCPRQLKVPPYLGYRFLGERDCGAPCEPGRAN GLMYFKEEERRFARLWVGVWSVLCCASTLFTVLTYLVDMRRFSYPERPIIFLSGCYFMVA VAHVAGFLLEDRAVCVERFSDDGYRTVAQGTKKEGCTILFMVLYFFGMASSIWWVILSLT WFLAAGMKWGHEAIEANSQYFHLAAWAVPAVKTITILAMGQVDGDLLSGVCYVGLSSVDA LRGFVLAPLFVYLFIGTSFLLAGFVSLFRIRTIMKHDGTKTEKLEKLMVRIGVFSVLYTV PATIVLACYFYEQAFREHWERTWLLQTCKSYAVPCPPGHFPPMSPDFTVFMIKYLMTMIV GITTGFWIWSGKTLQSWRRFYHRLSHSSKGETAV >gi568815596f:201934648_202136369|GENSCAN_predicted_CDS_3|1725_bp atgcgggaccccggcgcggccgctccgctttcgtccctgggcctctgtgccctggtgctg gcgctgctgggcgcactgtccgcgggcgccggggcgcagccgtaccacggagagaagggc atctccgtgccggaccacggcttctgccagcccatctccatcccgctgtgcacggacatc gcctacaaccagaccatcctgcccaacctgctgggccacacgaaccaagaggacgcgggc ctcgaggtgcaccagttctacccgctggtgaaggtgcagtgttctcccgaactccgcttt ttcttatgctccatgtatgcgcccgtgtgcaccgtgctcgatcaggccatcccgccgtgt cgttctctgtgcgagcgcgcccgccagggctgcgaggcgctcatgaacaagttcggcttc cagtggcccgagcggctgcgctgcgagaacttcccggtgcacggtgcgggcgagatctgc gtgggccagaacacgtcggacggctccgggggcccaggcggcggccccactgcctaccct accgcgccctacctgccggacctgcccttcaccgcgctgcccccgggggcctcagatggc agggggcgtcccgccttccccttctcatgcccccgtcagctcaaggtgcccccgtacctg ggctaccgcttcctgggtgagcgcgattgtggcgccccgtgcgaaccgggccgtgccaac ggcctgatgtactttaaggaggaggagaggcgcttcgcccgcctctgggtgggcgtgtgg tccgtgctgtgctgcgcctcgacgctctttaccgttctcacctacctggtggacatgcgg cgcttcagctacccagagcggcccatcatcttcctgtcgggctgctacttcatggtggcc gtggcgcacgtggccggcttccttctagaggaccgcgccgtgtgcgtggagcgcttctcg gacgatggctaccgcacggtggcgcagggcaccaagaaggagggctgcaccatcctcttc atggtgctctacttcttcggcatggccagctccatctggtgggtcattctgtctctcact tggttcctggcggccggcatgaagtggggccacgaggccatcgaggccaactcgcagtac ttccacctggccgcgtgggccgtgcccgccgtcaagaccatcactatcctggccatgggc caggtagacggggacctgctgagcggggtgtgctacgttggcctctccagtgtggacgcg ctgcggggcttcgtgctggcgcctctgttcgtctacctcttcataggcacgtccttcttg ctggccggcttcgtgtccctcttccgtatccgcaccatcatgaaacacgacggcaccaag accgagaagctggagaagctcatggtgcgcatcggcgtcttcagcgtgctctacacagtg cccgccaccatcgtcctggcctgctacttctacgagcaggccttccgcgagcactgggag cgcacctggctcctgcagacgtgcaagagctatgccgtgccctgcccgcccggccacttc ccgcccatgagccccgacttcaccgtcttcatgatcaagtacctgatgaccatgatcgtc ggcatcaccactggcttctggatctggtcgggcaagaccctgcagtcgtggcgccgcttc taccacagacttagccacagcagcaagggggagactgcggtatga >gi568815596f:201934648_202136369|GENSCAN_predicted_peptide_4|46_aa MGQPSAGLLTQPLALLWRLLPGSFRPYHIFKTYGCGGKMTKFRVTG >gi568815596f:201934648_202136369|GENSCAN_predicted_CDS_4|141_bp atgggccagccaagtgccgggctccttacccagcccctggcactgctgtggaggctgctg cctggttcctttagaccataccacatcttcaaaacctacggctgtgggggaaagatgact aaattcagagtcactgggtga >gi568815596f:201934648_202136369|GENSCAN_predicted_peptide_5|552_aa MFTLSLLSRGHGKLGQDKQKLEVYFEPEDYLNWRSPEDYVPVSKPQDKNNASQHSWSLFL PKTFSTRKGALILYSEGFAISAWTPKERRKGPYCPRGPWRKLDLELHTLQDLKEAILAYG RQQGEQDRAWQPYLHFRSQLESQAQRQIQPGHSAKRYLRGLLRTWPPDAMYRLWCAGYIK DSVLLQDSQLNVPKKLRPQQDLSGVPPKYHLLPVFPSFWIQQGKSFEQRQQGLDEGEAGA AGHVDQGPLAKNHGSQGTRLPPRRKQPWQEDETQAEAPKALKLPPISEEPPRVLEPLKSQ FKANEPPTELFILPVEIHYHTKQPPKEKAHRRGAPHPESEPESSEESTPVWRPPLKHASL ETPWELTVHLPVDASRDTLSPQDDDAPPHDVAPPLDLLPPIKGKKSPESQKGVDSPRTSD HNSPPSLPNMRVPRRALPAAQEDSSDPTLGHFLLGPDGEKVCLSLPGHTQTEALPSGKGK ISYCFSAYESVNSNISHEEEGPSSQHFLKGVKLQTFMVSVTGLKGSTSGVVRSFRWVRGL TGLKSEAADLRX >gi568815596f:201934648_202136369|GENSCAN_predicted_CDS_5|1656_bp atgttcacgctctccctcctgagccggggccacgggaagctgggccaggacaaacagaag ttagaagtctactttgaaccagaggattacttgaactggaggtccccagaagactatgtt cctgtcagcaaacctcaagataagaacaatgccagtcagcactcctggagcctctttctc cctaaaactttcagtactagaaagggtgccctgatcctgtactcagaaggttttgccatt tcggcatggacacccaaagagaggagaaaaggcccctactgccccagaggtccctggagg aagctggatcttgaactgcacacacttcaagacctcaaagaagccatcctggcatatgga aggcagcagggagagcaggacagagcctggcaaccatacctccacttccgaagccagctg gagagccaggcccaacggcagatccagccagggcattcagccaagagatacctccgaggc ctcctgcggacttggcccccagacgccatgtataggctctggtgcgcaggatacatcaag gattctgtgctactccaggacagtcaacttaatgtaccaaagaagctcaggccacaacaa gatctttcaggtgtacccccaaaataccatctcctgcctgtcttcccttcattttggatt caacaaggaaaatcttttgaacaacgtcaacagggcctggatgaaggggaagctggagct gctggacacgtggaccagggccctctagccaagaaccatggcagtcaggggactcgcttg ccaccacgcaggaagcagccctggcaggaagatgaaacgcaggcagaggcaccaaaggct ttgaaattacctcctatctcagaagaaccacccagagtcttggagcccctgaagagccaa tttaaagccaatgagcccccaacagagctcttcatcttaccggtggagattcattaccac accaaacaacccccaaaagagaaagcccacagaagaggtgctccacaccctgagtcagaa ccagaaagcagcgaagaatccacacctgtgtggagacctcccctgaagcatgcgtcctta gaaacaccatgggagttaacagtgcatctcccagtggacgcgagcagggacacactctca cctcaagatgatgatgccccacctcacgatgtggccccaccattggatcttctacccccg attaaaggaaaaaaaagtcctgagagccagaagggcgtggacagccctaggacatcagac cacaacagccccccaagtctcccgaacatgagagtgcccaggagggcactgccagcagct caagaagattccagcgaccctacactgggacacttcttgctgggtccagatggagaaaaa gtctgcctgtccctcccagggcatacccaaaccgaggcgcttccatcgggtaaaggtaaa atatcttactgtttttcagcctatgaatctgtcaattcaaatatcagccatgaagaggaa gggcctagtagtcagcatttcctaaaaggagtgaagctgcagaccttcatggtgagtgtt acaggtcttaaaggcagcacgtctggagttgttcgttctttccggtgggttcgtggtctt actggcctcaagagtgaagctgcagaccttcgcgnn