GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:09:45 Sequence gi568815578f:43573805_43816084 : 242280 bp : 46.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1105 1200 96 0 0 65 105 121 0.996 11.71 1.02 Intr + 2420 2575 156 1 0 103 28 217 0.905 17.41 1.03 Intr + 6168 6257 90 2 0 78 83 48 0.836 3.49 1.04 Intr + 10786 10803 18 0 0 105 70 37 0.503 0.41 1.05 Term + 11048 11212 165 1 0 60 43 182 0.993 8.82 1.06 PlyA + 13841 13846 6 1.05 2.00 Prom + 20203 20242 40 -0.76 2.01 Init + 20895 21013 119 2 2 105 47 153 0.747 12.57 2.02 Intr + 40046 40172 127 2 1 99 87 10 0.193 2.68 2.03 Intr + 45136 45222 87 0 0 72 103 49 0.947 4.97 2.04 Intr + 47053 47121 69 0 0 81 59 44 0.437 0.18 2.05 Term + 50087 50263 177 1 0 75 52 287 0.814 21.49 2.06 PlyA + 51331 51336 6 -1.75 3.00 Prom + 51404 51443 40 -5.56 3.01 Init + 51640 51645 6 0 0 104 101 10 0.106 4.21 3.02 Intr + 63341 63449 109 1 1 132 84 62 0.985 10.06 3.03 Intr + 68675 68820 146 0 2 73 92 140 0.871 12.90 3.04 Term + 73132 73179 48 0 0 92 49 51 0.807 -1.10 3.05 PlyA + 73828 73833 6 -0.45 4.00 Prom + 76308 76347 40 -5.76 4.01 Init + 78719 78894 176 1 2 83 52 126 0.991 7.32 4.02 Intr + 93248 93403 156 2 0 68 1 117 0.445 0.13 4.03 Intr + 95681 95752 72 2 0 92 97 10 0.599 0.82 4.04 Intr + 100002 100095 94 0 1 105 105 137 0.993 17.07 4.05 Intr + 107980 108051 72 0 0 92 76 98 0.921 8.60 4.06 Intr + 113048 113268 221 1 2 -6 99 423 0.071 31.10 4.07 Intr + 118353 118515 163 0 1 94 94 235 0.991 24.68 4.08 Intr + 125953 126240 288 0 0 85 88 355 0.953 32.64 4.09 Intr + 128686 129099 414 0 0 74 96 436 0.963 37.30 4.10 Intr + 131415 131554 140 2 2 78 66 167 0.961 12.76 4.11 Intr + 136159 136258 100 1 1 83 92 159 0.634 15.91 4.12 Intr + 137684 137797 114 1 0 72 84 227 0.949 21.34 4.13 Intr + 139198 139302 105 0 0 94 87 157 0.998 16.61 4.14 Intr + 141330 141479 150 2 0 80 76 170 0.995 15.36 4.15 Term + 142155 142283 129 2 0 94 51 137 0.893 8.78 4.16 PlyA + 142659 142664 6 1.05 5.06 PlyA - 144549 144544 6 1.05 5.05 Term - 148228 148091 138 1 0 69 44 130 0.243 4.66 5.04 Intr - 152831 152612 220 1 1 38 19 309 0.004 17.30 5.03 Intr - 162317 162171 147 1 0 110 69 25 0.364 2.15 5.02 Intr - 163512 163407 106 1 1 47 83 67 0.463 1.37 5.01 Init - 169248 169197 52 0 1 93 80 17 0.345 2.72 5.00 Prom - 171194 171155 40 -3.26 6.06 PlyA - 171769 171764 6 1.05 6.05 Term - 181990 181883 108 1 0 102 43 66 0.217 2.01 6.04 Intr - 192990 192807 184 2 1 90 8 114 0.193 3.39 6.03 Intr - 193385 193225 161 2 2 44 72 49 0.190 -2.31 6.02 Intr - 194096 193949 148 1 1 75 86 67 0.229 5.44 6.01 Init - 203797 203739 59 1 2 81 111 9 0.304 3.28 6.00 Prom - 211330 211291 40 -3.26 7.00 Prom + 218479 218518 40 -2.96 7.01 Init + 241563 242274 712 2 1 88 58 494 0.960 41.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 79295 79490 196 2 1 22 54 176 0.995 4.48 S.002 Init + 88828 88835 8 0 2 114 91 0 0.804 3.40 S.003 Init + 112991 113268 278 1 2 62 99 436 0.927 36.56 S.004 Init + 155263 155323 61 0 1 78 106 72 0.826 9.41 S.005 Term + 155591 155712 122 0 2 61 48 85 0.850 0.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_1|174_aa YLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVSQMYENILHQPLQIPGGRTV AACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDLYHKRLTPPFNPNVRHEQYL TGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPEDDDILDC >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_1|525_bp tacttggcacctgaagtgcttcggaaagagccttatgatcgagcagtggactggtggtgc ttgggggcagtcctctacgagatgctccatggcctgccgcccttctacagccaagatgta tcccagatgtatgagaacattctgcaccagccgctacagatccccggaggccggacagtg gccgcctgtgacctcctgcaaagccttctccacaaggaccagaggcagcggctgggctcc aaagcagactttcttgagattaagaaccatgtattcttcagccccataaactgggatgac ctgtaccacaagaggctaactccacccttcaacccaaatgtgcgccacgaacagtacctg acaggacctgctgacttgaagcattttgacccagagttcacccaggaagctgtgtccaag tccattggctgtacccctgacactgtggccagcagctctggggcctcaagtgcattcctg ggattttcttatgcgccagaggatgatgacatcttggattgctag >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_2|192_aa MEKELRSTILFNAYKKEIFTTNNGYKSMQKKLRSNWKIQRALTFVYPFGATLSVMKPAVA VLSTGSVCFPLNRPILAFYHSKNQGGKLAVLGSCHMFSDQYLDKEENSKIMDVVFQWLTT GDIHLNQIDAEDPEISDYMMLPYTATLSKRNRECLQESDEIPRDFTTLFDLSIFQLDTTS FHSVIESVPVGL >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_2|579_bp atggagaaagagctgcggagcaccattcttttcaatgcctacaaaaaggagatatttacc accaacaatggctacaaatccatgcagaaaaaacttcggagtaattggaagattcagagg gctctcacctttgtgtatccttttggtgccacattgagtgtcatgaaaccagcagtggcg gttctgtctacaggttctgtctgcttcccacttaacagacccattttggctttctatcac tcaaagaaccaaggtgggaagctggcagtgcttggttcatgtcacatgttcagtgatcaa tatttggacaaagaagaaaacagcaaaatcatggatgttgttttccagtggctcacgaca ggagacatccacctaaaccagattgatgctgaggacccagagatttctgactacatgatg ctgccctacacagccaccctatcaaagcggaatcgagagtgtctccaggagagtgatgag atcccaagggactttaccaccctcttcgacctgtccatcttccagctggataccacctcc ttccacagcgtcatcgagtcagtacctgtgggcctctga >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_3|102_aa MAVFPPSFRELPPPPLELFDLDETFSSEKARLAQITNKCTEEDLEFYVRKCGDILGVTSK LPKDQQDAKHILEHVFFQVVEFKKLNQEHDIDTSETAFQNNF >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_3|309_bp atggcggtttttcctcccagtttccgggagttaccacctcctcctctggagctatttgat ttagatgaaacgttctcctctgagaaggcacggctggctcagattaccaataagtgtact gaagaagacctggaattttatgtcaggaagtgtggtgatattcttggagtaaccagtaaa ctaccaaaggaccaacaggatgccaaacatatccttgagcacgtcttcttccaagtggtg gagttcaagaaattgaaccaggaacatgacatcgatacaagtgaaacagcattccagaac aatttctga >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_4|797_aa MPSRLRKTQKLRGHVSHGHGRIGKLQKHPRGHSNAGGMHHHRINFNKYYPGYFGKVGMRT GRRRRAWREIEKCFNPRRRRLQFLRARSAGPADTLTPSSAARGPERPEQPGAPLPALSLT HPCPGAEPSTLKGKSEDLDELHYQDTDSDVPEQRDSKCKVKWTHEEDEQLRALVRQFGQQ DWKFLASHFPVIELVKKYGTKQWTLIAKHLKGRLGKQCRERWHNHLNPEVKKSCWTEEED RIICEAHKVLGNRWAEIAKMLPGRTDNAVKNHWNSTIKRKVDTGGFLSESKDCKPPVYLL LELEDKDGLQSAQPTEGQGSLLTNWPSVPPTIKEEENSEEELAAATTSKEQEPIGTDLDA VRTPEPLEEFPKREDQEGSPPETSLPYKWVVEAANLLIPAVGSSLSEALDLIESDPDAWC DLSKFDLPEEPSAEDSINNSLVQLQASHQQQVLPPRQPSALVPSVTEYRLDGHTISDLSR SSRGELIPISPSTEVGGSGIGTPPSVLKRQRKRRVALSPVTENSTSLSFLDSCNSLTPKS TPVKTLPFSPSQFLNFWNKQDTLELESPSLTSTPVCSQKVVVTTPLHRDKTPLHQKHAAF VTPDQKYSMDNTPHTPTPFKNALEKYGPLKPLPQTPHLEEDLKEVLRSEAGIELIIEDDI RPEKQKRKPGLRRSPIKKVRKSLALDIVDEDVKLMMSTLPKSLSLPTTAPSNSSSLTLSG IKEDNSLLNQGFLQAKPEKAAVAQKPRSHFTTPAPMSSAWKTVACGGTRDQLFMQEKARQ LLGRLKPSHTSRTLILS >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_4|2394_bp atgccatccagactgaggaagacccagaaacttcggggccatgtgagccatggccatggc cgcataggcaagctccaaaagcaccccagaggccacagtaatgctggtggaatgcatcac cacaggatcaacttcaacaaatactacccaggttactttgggaaagttggtatgaggact gggcggcgccgacgcgcttggcgggagatagaaaagtgcttcaacccgcgccggcggcga ctgcagttcctgcgagcgaggagcgcgggacctgctgacacgctgacgccttcgagcgcg gcccggggcccggagcggccggagcagcccggtgccccacttcccgctttgtctctgact cacccctgtcctggagcagagcccagcacgttgaagggcaaaagcgaggatctggatgag ctgcactaccaggacacagattcagatgtgccggagcagagggatagcaagtgcaaggtc aaatggacccatgaggaggacgagcagctgagggccctggtgaggcagtttggacagcag gactggaagttcctggccagccacttccctgtcatcgagctggttaagaagtatggcaca aagcagtggacactgattgccaagcacctgaagggccggctggggaagcagtgccgtgaa cgctggcacaaccacctcaaccctgaggtgaagaagtcttgctggaccgaggaggaggac cgcatcatctgcgaggcccacaaggtgctgggcaaccgctgggccgagatcgccaagatg ttgccagggaggacagacaatgctgtgaagaatcactggaactctaccatcaaaaggaag gtggacacaggaggcttcttgagcgagtccaaagactgcaagcccccagtgtacttgctg ctggagctcgaggacaaggacggcctccagagtgcccagcccacggaaggccagggaagt cttctgaccaactggccctccgtccctcctaccataaaggaggaggaaaacagtgaggag gaacttgcagcagccaccacatcgaaggaacaggagcccatcggtacagatctggacgca gtgcgaacaccagagcccttggaggaattcccgaagcgtgaggaccaggaaggctcccca ccagaaacgagcctgccttacaagtgggtggtggaggcagctaacctcctcatccctgct gtgggttctagcctctctgaagccctggacttgatcgagtcggaccctgatgcttggtgt gacctgagtaaatttgacctccctgaggaaccatctgcagaggacagtatcaacaacagc ctagtgcagctgcaagcgtcacatcagcagcaagtcctgccaccccgccagccttccgcc ctggtgcccagtgtgaccgagtaccgcctggatggccacaccatctcagacctgagccgg agcagccggggcgagctgatccccatctcccccagcactgaagtcgggggctctggcatt ggcacaccgccctctgtgctcaagcggcagaggaagaggcgtgtggctctgtcccctgtc actgagaatagcaccagtctgtccttcctggattcctgtaacagcctcacgcccaagagc acacctgttaagaccctgcccttctcgccctcccagtttctgaacttctggaacaaacag gacacattggagctggagagcccctcgctgacatccaccccagtgtgcagccagaaggtg gtggtcaccacaccactgcaccgggacaagacacccctgcaccagaaacatgctgcgttt gtaaccccagatcagaagtactccatggacaacactccccacacgccaaccccgttcaag aacgccctggagaagtacggacccctgaagcccctgccacagaccccgcacctggaggag gacttgaaggaggtgctgcgttctgaggctggcatcgaactcatcatcgaggacgacatc aggcccgagaagcagaagaggaagcctgggctgcggcggagccccatcaagaaagtccgg aagtctctggctcttgacattgtggatgaggatgtgaagctgatgatgtccacactgccc aagtctctatccttgccgacaactgccccttcaaactcttccagcctcaccctgtcaggt atcaaagaagacaacagcttgctcaaccagggcttcttgcaggccaagcccgagaaggca gcagtggcccagaagccccgaagccacttcacgacacctgcccctatgtccagtgcctgg aagacggtggcctgcggggggaccagggaccagcttttcatgcaggagaaagcccggcag ctcctgggccgcctgaagcccagccacacatctcggaccctcatcttgtcctga >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_5|220_aa MGQCGWCAERIGRKRMPGWNCQRPYDAGQASPKVELSPQGFLALPRKEFKGQSLTCTSRM NYLYLKSHLRVYFWKNSSIIPVELHKTEKLKAAGGREANATSRFQYHLASCRRKNPKKAK KMATCKYNACHVVPIKNLEEHEAVCVNRSAVEEEDTENPLKVSPPSSEQNDDTQQGPLQL PKSQLPTQASSSTEQAGTLPYWTGLQLPKLQLWIQAFPCS >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_5|663_bp atgggccagtgtggctggtgtgcagagaggataggaaggaagcgcatgccaggatggaac tgccaaagaccttatgatgcagggcaggcaagccccaaagtggagctcagtccacaaggg ttcttggctttgcccaggaaagaattcaagggccagagccttacctgcacttcccgaatg aactacttatatctgaaatctcatcttagggtctacttctggaagaactctagcataata ccagtagaactccacaaaactgaaaagctcaaagcagcaggtggcagagaagcaaatgcc accagcagattccagtaccacctggcatcgtgcaggagaaagaaccccaagaaagccaaa aagatggccacctgcaaatacaacgcctgccacgtggtccccatcaaaaatctggaggaa catgaggctgtttgtgtcaacaggagcgctgtggaagaagaggacaccgagaaccctctg aaagtcagtcctcctagttcagagcagaacgatgacacccagcaggggccgctgcagctg cccaaatctcagctgccaacccaggcctccagctccacagagcaggcaggaaccctgccc tactggacagggctgcagctgcccaaactgcagctgtggatccaagccttcccgtgctct tag >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_6|219_aa MGPLLTSSIHRHDFFKGVDRFASTAARSSLAAENQKYSPFTSTGSLYLTNAAARNVSTAL PRHLYPHSQSLIHPRVPECHRCARHCMGRSMVNGTDTVPLEAALFLAVPSRQDMCAQPRL PSRAGANHSTSPSLNYLTNKMGTIAVAVREGPPIDSTNRDRAATICQALGVAVQSKEGDE CRLLLLPQSDETLLFVRKGPKINPTAHVGDGTEEEVKKD >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_6|660_bp atgggccctttattaacttccagcatccacaggcatgacttcttcaagggagttgaccgg tttgccagcacagctgccagaagctctctcgcagcagagaaccaaaaatacagcccattt acttcaacaggctccctctacctgacaaatgcagcagccaggaatgtatccactgctctt ccccggcatctatacccacattcccagtctctcattcacccaagggttcctgagtgtcac cgctgtgcccggcattgcatggggaggagcatggtgaatgggacagacacggtccctctg gaggcggctctcttcctggcagttcccagccgccaggacatgtgtgcccagcccagactc cctagcagggctggggcaaatcactcaacatctccaagcctcaactacctcaccaacaaa atggggacaatagcagtagctgtcagagaaggaccacctattgactccacaaatagagat cgggcagctactatttgccaggcacttggggttgctgtgcagagcaaagagggtgatgaa tgccggctcctgctactcccacaaagtgatgaaactttgctttttgtgagaaagggccct aaaataaatccaactgctcatgttggtgatgggacagaagaagaagtgaagaaagattaa >gi568815578f:43573805_43816084|GENSCAN_predicted_peptide_7|238_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEEYITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDAENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKX >gi568815578f:43573805_43816084|GENSCAN_predicted_CDS_7|714_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatatataactagaata accaatactgagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgcggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagnn