GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:46:11 Sequence gi568815588f:96204498_96438221 : 233724 bp : 43.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 119 35 85 2 1 65 63 92 0.123 3.79 1.10 Intr - 3402 3375 28 2 1 113 98 31 0.051 4.72 1.09 Intr - 5410 5341 70 2 1 50 114 11 0.093 -1.86 1.08 Intr - 12237 12156 82 0 1 53 116 92 0.184 7.71 1.07 Intr - 19492 19329 164 2 2 74 88 20 0.119 0.29 1.06 Intr - 23069 22913 157 2 1 88 66 305 0.540 27.88 1.05 Intr - 26337 26297 41 1 2 102 78 74 0.893 5.74 1.04 Intr - 38287 38238 50 0 2 100 110 49 0.953 6.62 1.03 Intr - 42552 42487 66 2 0 98 106 32 0.933 4.02 1.02 Intr - 43383 43280 104 0 2 23 82 116 0.875 3.47 1.01 Init - 45278 45171 108 2 0 80 72 50 0.806 3.12 1.00 Prom - 51751 51712 40 -5.56 2.00 Prom + 58128 58167 40 -5.66 2.01 Init + 62778 62887 110 2 2 51 113 59 0.582 4.39 2.02 Intr + 68280 68444 165 0 0 85 25 353 0.445 27.78 2.03 Intr + 71748 71784 37 0 1 84 70 23 0.224 -1.64 2.04 Intr + 72956 73038 83 1 2 70 91 65 0.111 3.44 2.05 Intr + 99928 100203 276 1 0 18 91 185 0.025 8.43 2.06 Intr + 113855 114029 175 2 1 73 81 173 0.998 15.04 2.07 Intr + 114765 114893 129 2 0 42 49 111 0.918 3.49 2.08 Intr + 116121 116291 171 2 0 92 94 61 0.899 7.24 2.09 Intr + 118160 118231 72 1 0 53 106 100 0.995 7.90 2.10 Intr + 119769 119892 124 2 1 73 99 42 0.991 3.96 2.11 Intr + 122971 123103 133 2 1 58 84 111 0.925 7.40 2.12 Intr + 124270 124333 64 1 1 44 100 59 0.734 1.42 2.13 Intr + 127854 128099 246 2 0 118 114 92 0.921 12.46 2.14 Intr + 131394 131477 84 2 0 68 87 100 0.856 7.92 2.15 Term + 138918 139061 144 2 0 41 47 83 0.057 -2.59 2.16 PlyA + 140225 140230 6 1.05 3.08 PlyA - 140748 140743 6 1.05 3.07 Term - 141790 141761 30 1 0 130 36 -6 0.127 -3.75 3.06 Intr - 143848 143792 57 1 0 83 94 17 0.297 0.88 3.05 Intr - 145329 145210 120 0 0 108 45 74 0.386 5.79 3.04 Intr - 150793 150758 36 1 0 94 98 29 0.481 2.96 3.03 Intr - 154769 154707 63 2 0 92 65 48 0.602 1.81 3.02 Intr - 156055 155895 161 2 2 56 50 121 0.717 4.81 3.01 Init - 157241 157193 49 2 1 86 58 65 0.833 2.41 3.00 Prom - 158029 157990 40 -5.56 4.21 PlyA - 159545 159540 6 1.05 4.20 Term - 163725 163591 135 0 0 94 39 258 0.982 19.52 4.19 Intr - 165818 165568 251 0 2 91 98 534 0.999 51.86 4.18 Intr - 169312 169099 214 1 1 128 82 263 0.974 27.99 4.17 Intr - 172322 172195 128 0 2 90 80 84 0.998 8.20 4.16 Intr - 174595 174470 126 2 0 37 72 187 0.972 12.65 4.15 Intr - 180270 180090 181 0 1 102 85 304 0.975 30.94 4.14 Intr - 181718 181558 161 0 2 62 52 139 0.999 7.41 4.13 Intr - 182581 182456 126 2 0 96 81 187 0.993 19.45 4.12 Intr - 184957 184927 31 1 1 134 85 -11 0.669 0.90 4.11 Intr - 188469 188273 197 1 2 50 87 44 0.135 -0.37 4.10 Intr - 190885 190690 196 1 1 103 55 313 0.536 28.49 4.09 Intr - 191523 191378 146 1 2 85 105 82 0.955 9.60 4.08 Intr - 192805 192689 117 2 0 114 76 203 0.964 22.14 4.07 Intr - 200837 200735 103 2 1 88 106 10 0.797 2.55 4.06 Intr - 205977 205862 116 1 2 56 100 66 0.961 4.77 4.05 Intr - 208819 208695 125 0 2 115 93 83 0.999 11.73 4.04 Intr - 216564 216459 106 1 1 49 58 99 0.624 2.27 4.03 Intr - 218230 218052 179 0 2 63 110 101 0.832 9.36 4.02 Intr - 224251 224134 118 2 1 74 106 27 0.343 2.62 4.01 Intr - 228465 228310 156 1 0 53 65 88 0.288 2.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:96204498_96438221|GENSCAN_predicted_peptide_1|319_aa MALAMAFLRAEAQTQPQIQGYHPLPSPKPPPLKIPQGFPIGYLVTSYVNEGVAHDQSDWL QKVINQTLKLRQLQKMVHDIKNNEGGIMNKIKKLKVKAPPSVPRRDYASESPADEEEQWS DDFDSDYENPDEHSDSEMYVMPAEENADDSYEPPPVEQETRPVHPALPFARGEYIDNRSS QRHSPPFSKTLPSKPSWPSEKARLTSTLPALTALQKPQVPPKPKGLLEDEADYVVPVEDN DENYIHPTESSSPPPEKGRNSGAWETKSPPPAAPSPLPRAGKKPTTPLKTKNLYLLNATE GQVTDKKLCSHQCFLLPRX >gi568815588f:96204498_96438221|GENSCAN_predicted_CDS_1|957_bp atggctctggctatggcttttctcagagcagaggcccagacccagcctcagattcagggc taccaccccctgccatcgcccaaacccccaccactgaaaatccctcaggggtttcccatt ggttacttggttacatcctatgtaaatgaaggagtggcccacgaccagtctgattggttg cagaaggtgatcaatcagacgctgaagttaaggcagcttcaaaagatggtccatgatatt aaaaacaatgaaggtggaataatgaataaaatcaaaaagctaaaagtcaaagcacctcca agtgttcctcgaagggactacgcttcagagagccctgctgacgaagaggagcagtggtcc gatgactttgacagcgactatgaaaatccagatgagcactcggactcagagatgtacgtg atgcccgccgaggagaacgctgatgacagctacgagccgcctccagtagagcaggaaacc aggccggttcacccagccctgcccttcgccagaggcgagtatatagacaatcgatcaagc cagaggcattccccacccttcagcaagacacttcccagtaagcccagctggccttcagag aaagcaaggctcacctccaccctgccggccctgactgctttgcagaaacctcaagtccca cccaaacccaaaggcctccttgaggatgaggctgattatgtggtccccgtggaagataat gatgaaaactatattcatcccacagaaagcagttcacctccacctgaaaaaggtcgaaac agtggggcctgggaaaccaagtcacctccaccagctgcaccatccccgttgccacgggcc gggaaaaaaccaacgacaccactgaagacaaaaaacctatacctgctgaacgccaccgag ggtcaagtcacagacaagaagctgtgcagtcaccagtgtttcctcctgcccagaann >gi568815588f:96204498_96438221|GENSCAN_predicted_peptide_2|670_aa MLWKKRKEKPYMEEKSHKENLQGVVGEWVSAEVKRERPKIWLKTVGKEEKTEEEKEVEEE EEEEEEEAEEEEEEEAEEEETEEEGEKEQRQWASLGKLPQEVGKSHGPAASQNHHPEVKK QIHKALGNQESRRVAVSLPSGDTTRWASQRQQQPLPMDPPRASHLSPRKKRPRQTGALMA SSPQDIKFQDLVVFILEKKMGTTRRAFLMELARRKGFRVENELSDSVTHIVAENNSGSDV LEWLQAQKVQVSSQPELLDVSWLIECIRAGKPVEMTGKHQLVVRRDYSDSTNPGPPKTPP IAVQKISQYACQRRTTLNNCNQIFTDAFDILAENCEFRENEDSCVTFMRAASVLKSLPFT IISMKDTEGIPCLGSKVKGIIEEIIEDGESSEVKAVLNDERYQSFKLFTSVFGVGLKTSE KWFRMGFRTLSKVRSDKSLKFTRMQKAGFLYYEDLVSCVTRAEAEAVSVLVKEAVWAFLP DAFVTMTGGFRSPGSTEDEEQLLQKVMNLWEKKGLLLYYDLVESTFEKLRLPSRKVDALD HFQKCFLIFKLPRQRVDSDQSSWQEGKTWKAIRVDLVLCPYERRAFALLGWTGSRQFERD LRRYATHERKMILDNHALYDKTKLHGLVTCIFQSSLTTTPWAKRMEDENCCLHFTSTELR LWMINNRPKG >gi568815588f:96204498_96438221|GENSCAN_predicted_CDS_2|2013_bp atgttgtggaagaaaaggaaagaaaagccatatatggaagagaagagccataaagaaaac ctgcagggtgtggtaggtgagtgggtgtcagctgaggtgaagagggaaagacccaagatc tggctgaagacagtggggaaagaggagaagacagaagaggagaaggaggtggaggaagag gaggaggaggaggaggaagaagcggaggaggaggaggaagaagaagcggaggaggaggag acagaggaggagggggagaaggaacagcggcagtgggccagtttaggcaaactgccacag gaagttggcaagtcacatggcccagccgctagtcagaaccaccacccagaagtgaagaag cagatacacaaggccttaggaaatcaggagtccaggagggtggcagtctccctcccttct ggagacaccaccagatgggccagccagaggcagcagcagcctcttcccatggatccacca cgagcgtcccacttgagccctcggaagaagagaccccggcagacgggtgccttgatggcc tcctctcctcaagacatcaaatttcaagatttggtcgtcttcattttggagaagaaaatg ggaaccacccgcagagcgttcctcatggagctggcccgcaggaaagggttcagggttgaa aatgagctcagtgattctgtcacccacatcgtagcagagaacaactcgggttcggatgtt ctggagtggcttcaagcacagaaagtacaagtcagctcacaaccagagctcctcgatgtc tcctggctgatcgaatgcataagagcagggaaaccggtggaaatgacaggaaaacaccag cttgttgtgagaagagactattcagatagcaccaacccaggccccccgaagactccacca attgctgtacaaaagatctcccagtatgcgtgtcagagaagaaccactttaaacaactgt aaccagatattcacggatgcctttgatatactggctgaaaactgtgagtttagagaaaat gaagactcctgtgtgacatttatgagagcagcttctgtattgaaatctctgccattcaca atcatcagtatgaaggacacagaaggaattccctgcctggggtccaaggtgaagggtatc atagaggagattattgaagatggagaaagttctgaagttaaagctgtgttaaatgatgaa cgatatcaatccttcaaactctttacttctgtatttggagtggggctgaagacttctgag aagtggttcaggatgggtttcagaactctgagtaaagtaaggtcggacaaaagcctgaaa tttacacgaatgcagaaagcaggatttctgtattatgaagaccttgtcagctgtgtgacc agggcagaagcagaggccgtcagtgtgctggttaaagaggctgtctgggcatttcttccg gatgctttcgtcaccatgacaggagggttccggagcccaggatcaacagaggatgaagag caacttttacagaaagtgatgaacttatgggaaaagaagggattacttttatattatgac cttgtggagtcaacatttgaaaagctcaggttgcctagcaggaaggttgatgctttggat cattttcaaaagtgctttctgattttcaaattgcctcgtcaaagagtggacagtgaccag tccagctggcaggaaggaaagacctggaaggccatccgtgtggatttagttctgtgcccc tacgagcgtcgtgcctttgccctgttgggatggactggctcccggcagtttgagagagac ctccggcgctatgccacacatgagcggaagatgattctggataaccatgctttatatgac aagaccaagctacatggcttagtcacatgcatatttcaatccagccttacaaccacccca tgggctaaaaggatggaggatgagaactgctgtctccactttacaagtacagaactgagg ctctggatgatcaacaatcggcccaagggctga >gi568815588f:96204498_96438221|GENSCAN_predicted_peptide_3|171_aa MGFRHVGQAVLELLTSAVLALCPVYLLMHTGGFKTIHRPATPKRMSSAPDPLYPVAYLAP TLGQPLGYSGGCRLQKPPSSQASCGFFAEMESFSLNFTLPANTDCGPSLGLAAGIPLLVA TALLVALLFTLIHRRRSSIEAMEESDRPCEISEIDDNPKISEDHCPSLPVV >gi568815588f:96204498_96438221|GENSCAN_predicted_CDS_3|516_bp atggggtttcgccatgttggccaggctgttctcgaactcctgacctcagctgtcctggcc ctctgccctgtctacctgctcatgcacaccgggggctttaaaaccatccacaggccggcc actcccaaacggatgtcttcagccccagatcccctttatccagtcgcctacttggcaccc acacttggacagcccctaggctactcaggtggctgcaggctgcagaagcccccaagctct caggccagctgtggcttctttgctgagatggagagtttttcactgaacttcaccctgccg gcgaacacagactgtgggccctctcttggattagcggcgggcataccattgctggtggcc acagccctgctggtggctttactatttactttgattcaccgaagaagaagcagcattgag gccatggaggaaagtgacagaccatgtgaaatttcagaaattgatgacaatcccaagata tctgaggaccattgtccctccttgcctgttgtctag >gi568815588f:96204498_96438221|GENSCAN_predicted_peptide_4|970_aa XGRENTTLLHSPGTLHAAAKTFSPRVRRATTSRTERIWPGGVIPYVIGGNFTGSQRAIFK QAMRHWEKHTCVTFIERTDEESFIVFSYRTCGCCSYVGRRGGGPQAISIGKNCDKFGIVA HELGHVVGFWHEHTRPDRDQHVTIIRENIQPGQEYNFLKMEAGEVSSLGETYDFDSIMHY ARNTFSRGVFLDTILPRQDDNGVRPTIGQRVRLSQGDIAQARKLYKCPACGETLQDTTGN FSAPGFPNGYPSYSHCVWRISVTPGEKIVLNFTSMDLFKSRLCWYDYVEVRDGYWRKAPL LGRFCGDKIPEPLVSTDSRLWVEFRSSSNILGKGFFAAYEATCGGDMNKDAGQIQSPNYP DDYRPSKECVWRITVSEGFHVGLTFQAFEIERHDSCAYDYLEVRDGPTEESALIGHFCGY EKPEDVKSSSNRLWMKFVSDGSINKAGFAANFFKAHLLLLTLTSFHLFALHGPGVDIEPT QIIQENLLISKILHLTTPAESFFHLIKILNLTTLAESYIKACTAPLPPGPEVDECSWPDH GGCEHRCVNTLGSYKCACDPGYELAADKKMCEVACGGFITKLNGTITSPGWPKEYPTNKN CVWQVVAPAQYRISLQFEVFELEGNDVCKYDFVEVRSGLSPDAKLHGRFCGSETPEVITS QSNNMRVEFKSDNTVSKRGFRAHFFSDKDECAKDNGGCQHECVNTFGSYLCRCRNGYWLH ENGHDCKEAGCAHKISSVEGTLASPNWPDKYPSRRECTWNISSTAGHRVKLTFNEFEIEQ HQECAYDHLEMYDGPDSLAPILGRFCGSKKPDPTVASGSSMFLRFYSDASVQRKGFQAVH STECGGRLKAEVQTKELYSHAQFGDNNYPSEARCDWVIVAEDGYGVELTFRTFEVEEEAD CGYDYMEAYDGYDSSAPRLGRFCGSGPLEEIYSAGDSLMIRFRTDDTINKKGFHARYTST KFQDALHMKK >gi568815588f:96204498_96438221|GENSCAN_predicted_CDS_4|2913_bp natggccgggagaataccacactcctgcacagccctgggaccttgcatgccgcagccaag accttctctccccgggtccgaagagccacaacctcaaggacagagaggatatggcctgga ggagtcatcccctacgtcattggagggaacttcactgggagccagagggccatttttaag caggccatgagacactgggagaagcacacctgtgtgaccttcatagaaaggacggatgag gaaagctttattgtattcagttacagaacctgtggctgttgctcctatgttgggcgccga ggaggaggcccacaggccatatccattgggaagaactgtgacaagtttggcattgtggct cacgagctgggccatgtggttgggttttggcatgaacacacccggccagacagagaccaa catgtcaccatcatcagggaaaacatccagccaggtcaggagtataatttcttaaaaatg gaagctggggaagtgagctctctgggagagacatacgactttgacagcatcatgcactac gcccggaacaccttctcaagaggagttttcttagacaccatccttccccgtcaagatgac aatggcgtcaggccaaccattggccagcgcgtgcggctcagtcagggagacatagctcaa gcccggaagctgtacaaatgcccagcgtgtggggagaccctgcaggacacaacgggaaac ttttctgcacctggtttcccaaatgggtacccatcttactcccactgcgtctggaggatc tcggtcaccccaggggaaaagatcgtattaaacttcacatccatggatttgtttaaaagc cgactgtgctggtatgattacgtggaggtccgggatggttactggagaaaagcccccctt ttgggcaggttttgtggcgataagatcccggagcccctcgtctccacggacagccggctc tgggtggagttccgcagcagcagcaacatcttgggcaagggcttctttgcagcgtacgaa gctacctgcgggggagacatgaacaaagatgccggtcagattcaatctcccaactatccg gatgactacagaccttccaaggaatgtgtctggaggattacggtttcagaggggtttcac gtgggacttaccttccaagcttttgagattgaaaggcacgacagctgtgcatatgactac ctggaagtccgggatggccccacggaagagagtgccctgatcggccacttttgtggctat gagaagccggaggatgtgaaatcgagctccaacagactgtggatgaagtttgtgtccgat ggctctatcaataaagcgggctttgcagccaattttttcaaggcacatctcctccttctg accctaacttccttccacctctttgccttacacggacctggtgttgacattgagcccacc cagataatccaggagaatctcctcatctccaagatccttcatttaaccacacctgcagag tccttctttcacctaattaagatccttaatttaaccacacttgcagaatcttacattaag gcctgcactgctccactcccacctggtccagaggtggatgagtgttcctggccagatcac ggcgggtgcgagcatcgctgtgtgaacacgctgggcagctacaagtgtgcctgtgaccct ggctacgagctggccgccgataagaagatgtgtgaagtggcctgtggcggtttcattacc aagctgaatggaaccatcaccagccctgggtggccgaaggagtatcccacaaacaaaaac tgtgtctggcaggtggtggcccccgctcagtaccggatctcccttcagtttgaagtgttt gaactggaaggcaatgacgtctgtaagtacgactttgtagaggtgcgcagcggcctgtcc cccgacgccaagctgcacggcaggttctgcggctctgagacgccggaggtcatcacctcg cagagcaacaacatgcgcgtggagttcaagtccgacaacaccgtctccaagcgcggcttc agggcccacttcttctcagataaggacgagtgtgccaaggacaacggcgggtgtcagcat gagtgcgtcaacaccttcgggagctacctgtgcaggtgcagaaacggctactggctccac gagaatgggcatgactgcaaagaggctggctgtgcacacaagatcagcagtgtggagggg accctggcgagccccaactggcctgacaaataccccagccggagggagtgtacctggaac atctcttcgactgcaggccacagagtgaaactcacctttaatgagtttgagatcgagcag caccaggaatgtgcctatgaccacctggaaatgtatgacgggccggacagcctggccccc attctgggccgtttctgcggcagcaagaaaccagaccccacggtggcttccggcagcagt atgtttctcaggttttattcggatgcctcagtgcagaggaaaggcttccaggcagtgcac agcacagagtgcgggggcaggctgaaggctgaagtgcagaccaaagagctctattcccac gcccagtttggggacaacaactacccgagcgaggcccgctgtgactgggtgatcgtggca gaggacggctacggcgtggagctgacattccggacctttgaggttgaggaggaggccgac tgcggctacgactacatggaagcctacgacggctacgacagctcagcgcccaggctcggc cgcttctgtggctctgggccattagaagaaatctactctgcaggtgattccctgatgatt cgattccgcacagatgacaccatcaacaagaaaggctttcatgcccgatacaccagcacc aagttccaggatgccctgcacatgaagaaatag