GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:14:43 Sequence gi568815582f:23655050_23857580 : 202531 bp : 45.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3458 3576 119 1 2 96 75 106 0.988 9.36 1.02 Intr + 6121 6232 112 1 1 114 68 56 0.595 6.58 1.03 Intr + 10577 10679 103 1 1 9 91 69 0.832 -0.95 1.04 Term + 11998 12095 98 2 2 77 39 130 0.872 5.13 1.05 PlyA + 12215 12220 6 -0.45 2.03 PlyA - 13400 13395 6 1.05 2.02 Term - 14553 14336 218 1 2 120 48 98 0.315 6.31 2.01 Init - 16885 16804 82 1 1 37 84 39 0.016 -0.47 2.00 Prom - 17332 17293 40 -5.36 3.00 Prom + 18925 18964 40 -4.96 3.01 Init + 23884 24291 408 0 0 76 96 647 0.926 60.35 3.02 Intr + 25035 25203 169 2 1 99 95 253 0.999 26.62 3.03 Intr + 25865 26009 145 0 1 101 103 86 0.997 10.74 3.04 Intr + 27015 27108 94 0 1 91 57 70 0.999 4.17 3.05 Intr + 28821 29057 237 2 0 99 86 278 0.754 26.51 3.06 Intr + 32130 32244 115 2 1 46 53 103 0.869 2.62 3.07 Intr + 32420 32575 156 0 0 102 98 207 0.995 23.08 3.08 Intr + 33619 33696 78 2 0 108 74 109 0.990 11.02 3.09 Intr + 34189 34343 155 2 2 69 113 202 0.973 20.59 3.10 Intr + 34445 34627 183 1 0 55 99 295 0.999 27.28 3.11 Term + 34811 35014 204 1 0 128 38 475 0.999 43.87 3.12 PlyA + 35298 35303 6 -1.95 4.21 PlyA - 35587 35582 6 -0.45 4.20 Term - 35994 35782 213 0 0 76 51 183 0.967 10.63 4.19 Intr - 36147 36080 68 1 2 91 95 63 0.887 5.92 4.18 Intr - 36376 36253 124 1 1 46 80 163 0.452 11.46 4.17 Intr - 37041 36914 128 1 2 105 71 6 0.866 1.00 4.16 Intr - 37282 37135 148 1 1 75 89 161 0.988 14.71 4.15 Intr - 39878 39679 200 0 2 120 76 256 0.999 26.67 4.14 Intr - 40069 39970 100 1 1 43 48 117 0.999 2.88 4.13 Intr - 40340 40151 190 1 1 78 105 274 0.999 27.69 4.12 Intr - 40929 40845 85 1 1 118 117 11 0.998 5.88 4.11 Intr - 45655 45490 166 1 1 52 105 149 0.975 12.53 4.10 Intr - 47224 47103 122 2 2 108 78 37 0.733 4.91 4.09 Intr - 47488 47341 148 1 1 90 77 30 0.733 1.91 4.08 Intr - 47653 47575 79 0 1 108 83 79 0.974 8.95 4.07 Intr - 50098 49834 265 2 1 57 37 307 0.535 19.07 4.06 Intr - 51382 51281 102 2 0 34 82 117 0.964 5.85 4.05 Intr - 51812 51705 108 0 0 92 94 108 0.999 12.06 4.04 Intr - 51980 51958 23 1 2 97 99 23 0.562 1.49 4.03 Intr - 55500 55467 34 1 1 106 89 91 0.864 8.28 4.02 Intr - 55969 55864 106 1 1 132 89 107 0.999 14.99 4.01 Init - 58138 58046 93 1 0 94 94 141 0.999 13.68 4.00 Prom - 86564 86525 40 -5.16 5.00 Prom + 91746 91785 40 -4.16 5.01 Init + 100001 100067 67 1 1 93 68 87 0.993 8.47 5.02 Intr + 100612 100684 73 2 1 104 78 110 0.999 10.06 5.03 Intr + 100798 100878 81 1 0 74 68 88 0.947 4.15 5.04 Intr + 101014 101144 131 1 2 107 74 80 0.666 8.84 5.05 Intr + 101339 101400 62 0 2 103 52 71 0.974 3.45 5.06 Intr + 102152 102274 123 1 0 99 77 182 0.981 18.98 5.07 Term + 102481 102534 54 0 0 86 41 104 0.936 3.06 5.08 PlyA + 102623 102628 6 1.05 6.00 Prom + 111957 111996 40 -3.56 6.01 Init + 114587 114704 118 1 1 31 0 180 0.108 3.86 6.02 Intr + 131849 131949 101 0 2 62 84 57 0.134 2.53 6.03 Intr + 134575 134610 36 0 0 86 113 15 0.161 2.26 6.04 Intr + 152074 152143 70 0 1 114 81 55 0.192 6.15 6.05 Intr + 181068 181299 232 1 1 88 93 299 0.127 27.23 6.06 Intr + 182326 182357 32 1 2 129 102 36 0.978 6.87 6.07 Term + 188263 188285 23 2 2 85 52 18 0.342 -3.63 6.08 PlyA + 188343 188348 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 181127 181299 173 1 2 92 93 292 0.866 29.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_1|143_aa TIVMNDCIIRGDLANVRVGRHCVVKSRSVIRPPFKKFSKGVAFFPLHIGDHVFIEEDCVV NAAQIGSYVHVGKNCVIGRRCVLKDCCKILDNTVLPPETVVPPFTVFSGCPGLFSGELPE CTQELMIDVTKSYYQKFLPLTQV >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_1|432_bp accattgtgatgaatgactgtattatccgaggggatctggcaaatgtaagagttggacgt cattgtgttgtgaaaagtcgtagtgtcataaggccaccattcaagaagttcagcaaaggt gttgcattctttcctttacatattggagaccatgtctttattgaggaagattgtgtggtc aacgcagcacagattggttcctatgttcatgttgggaagaactgtgtgattgggcgccga tgtgtgttgaaagactgctgcaaaattcttgacaacacagtattacctccagaaactgtg gttccaccattcactgtcttctcaggctgcccaggactcttctcaggggagctcccggag tgcactcaggagctgatgattgacgtcaccaagagctactaccagaagtttttgcccctg acgcaagtctag >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_2|99_aa MSLILDGFHLHDPFAQCPMDVKIQKSPGNIICHHNHQHQLSVSVGTCQACPFGVFHLPGH NDDLRLDTFLVWTNENQIQQFCQQRGALFINNWCCGATL >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_2|300_bp atgtctcttatcttggatggttttcatctgcatgacccatttgcccaatgcccaatggat gtgaagatacagaaaagtccaggtaatatcatctgccaccacaaccaccagcaccaactc tcagtctctgtgggtacatgccaggcctgtccatttggtgtattccatcttcctggccac aatgatgacttgaggctggataccttcctcgtctggaccaatgagaaccaaatccagcag ttctgtcagcaaaggggagctctttttatcaataactggtgctgtggggccacactgtga >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_3|647_aa MSAAVTAGKLARAPADPGKAGVPGVAAPGAPAAAPPAKEIPEVLVDPRSRRRYVRGRFLG KGGFAKCFEISDADTKEVFAGKIVPKSLLLKPHQREKMSMEISIHRSLAHQHVVGFHGFF EDNDFVFVVLELCRRRSLLELHKRRKALTEPEARYYLRQIVLGCQYLHRNRVIHRDLKLG NLFLNEDLEVKIGDFGLATKVEYDGERKKTLCGTPNYIAPEVLSKKGHSFEVDVWSIGCI MYTLLVGKPPFETSCLKETYLRIKKNEYSIPKHINPVAASLIQKMLQTDPTARPTINELL NDEFFTSGYIPARLPITCLTIPPRFSIAPSSLDPSNRKPLTVLNKGTTRVWLGSSELLVP SSCLPEWQVILQGHTVGSEATASEDLLPHSLENPLPERPREKEEPVVRETGEVVDCHLSD MLQQLHSVNASKPSERGLVRQEEAEDPACIPIFWVSKWVDYSDKYGLGYQLCDNSVGVLF NDSTRLILYNDGDSLQYIERDGTESYLTVSSHPNSLMKKITLLKYFRNYMSEHLLKAGAN ITPREGDELARLPYLRTWFRTRSAIILHLSNGSVQINFFQDHTKLILCPLMAAVTYIDEK RDFRTYRLSLLEEYGCCKELASRLRYARTMVDKLLSSRSASNRLKAS >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_3|1944_bp atgagtgctgcagtgactgcagggaagctggcacgggcaccggccgaccctgggaaagcc ggggtccccggagttgcagctcccggagctccggcggcggctccaccggcgaaagagatc ccggaggtcctagtggacccacgcagccggcggcgctatgtgcggggccgctttttgggc aagggcggctttgccaagtgcttcgagatctcggacgcggacaccaaggaggtgttcgcg ggcaagattgtgcctaagtctctgctgctcaagccgcaccagagggagaagatgtccatg gaaatatccattcaccgcagcctcgcccaccagcacgtcgtaggattccacggctttttc gaggacaacgacttcgtgttcgtggtgttggagctctgccgccggaggtctctcctggag ctgcacaagaggaggaaagccctgactgagcctgaggcccgatactacctacggcaaatt gtgcttggctgccagtacctgcaccgaaaccgagttattcatcgagacctcaagctgggc aaccttttcctgaatgaagatctggaggtgaaaataggggattttggactggcaaccaaa gtcgaatatgacggggagaggaagaagaccctgtgtgggactcctaattacatagctccc gaggtgctgagcaagaaagggcacagtttcgaggtggatgtgtggtccattgggtgtatc atgtataccttgttagtgggcaaaccaccttttgagacttcttgcctaaaagagacctac ctccggatcaagaagaatgaatacagtattcccaagcacatcaaccccgtggccgcctcc ctcatccagaagatgcttcagacagatcccactgcccgcccaaccattaacgagctgctt aatgacgagttctttacttctggctatatccctgcccgtctccccatcacctgcctgacc attccaccaaggttttcgattgctcccagcagcctggaccccagcaaccggaagcccctc acagtcctcaataaaggtacaacaagggtctggctgggcagctctgagctgctggtgcct tcaagctgtttgcctgagtggcaggtcatcctgcagggccacaccgttgggtcagaggcc acggcttctgaagatctgttgcctcacagcttggagaaccccctgcctgagcgtccccgg gaaaaagaagaaccagtggttcgagagacaggtgaggtggtcgactgccacctcagtgac atgctgcagcagctgcacagtgtcaatgcctccaagccctcggagcgtgggctggtcagg caagaggaggctgaggatcctgcctgcatccccatcttctgggtcagcaagtgggtggac tattcggacaagtacggccttgggtatcagctctgtgataacagcgtgggggtgctcttc aatgactcaacacgcctcatcctctacaatgatggtgacagcctgcagtacatagagcgt gacggcactgagtcctacctcaccgtgagttcccatcccaactccttgatgaagaagatc accctccttaaatatttccgcaattacatgagcgagcacttgctgaaggcaggtgccaac atcacgccgcgcgaaggtgatgagctcgcccggctgccctacctacggacctggttccgc acccgcagcgccatcatcctgcacctcagcaacggcagcgtgcagatcaacttcttccag gatcacaccaagctcatcttgtgcccactgatggcagccgtgacctacatcgacgagaag cgggacttccgcacataccgcctgagtctcctggaggagtacggctgctgcaaggagctg gccagccggctccgctacgcccgcactatggtggacaagctgctgagctcacgctcggcc agcaaccgtctcaaggcctcctaa >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_4|833_aa MASAVRGSRPWPRLGLQLQFAALLLGTLSPQVHTLRPENLLLVSTLDGSLHALSKQTGDL KWTLRDDPVIEGPMYVTDSDGVFYTGRKQDAWFVVDPESGETQMTLTTEGPSTPRLYIGR TQYTVTMHDPRAPALRWNTTYRRYSAPPMDGSPGKYMSHLASCGMGLLLTVDPGSGTVLW TQDLGVPVMGVYTWHQDGLRQLPHLTLARDTLHFLALRWGHIRLPASGPRDTATLFSTLD TQLLMTLYVGKDETGFYVSKALVHTGVALVPRGLTLAPADGPTTDEVTLQVSGEREGSPS TAVRYPSGSVALPSQWLLIGHHELPPVLHTTMLRVHPTLGSGTAETRPPENTQAPAFFLE QQPQVVEKQQETPLAPADFAHISQDAQSLHSGASRRSQKRLQSPSKQAQPLDDPEAEQLT VVGKISFNPKDVLGRGAGGTFVFRGQFEGRAVAVKRLLRECFGLVRREVQLLQESDRHPN VLRYFCTERGPQFHYIALELCRASLQEYVENPDLDRGGLEPEVVLQQLMSGLAHLHSLHI VHRDLKPGNILITGPDSQGLGRVVLSDFGLCKKLPAGRCSFSLHSGIPGTEGWMAPELLQ LLPPDSPTSAVDIFSAGCVFYYVLSGGSHPFGDSLYRQANILTGAPCLAHLEEEVHDKVV ARDLVGAMLSPLPQPRPSAPQVLAHPFFWSRAKQLQFFQDVSDWLEKESEQEPLVRALEA GGCAVVRDNWHEHISMPLQTDLRKFRSYKGTSVRDLLRAVRNKKHHYRELPVEVRQALGQ VPDGFVQYFTNRFPRLLLHTHRAMRSCASESLFLPYYPPDSEARRPCPGATGR >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_4|2502_bp atggcgagtgcggtcagggggtcgaggccgtggccccggctggggctccagctccagttc gcggcgctgctgctcgggacgctgagtccacaggttcatactctcaggccagagaacctc ctgctggtgtccaccttggatggaagtctccacgcactaagcaagcagacaggggacctg aagtggactctgagggatgatcccgtcatcgaaggaccaatgtacgtcacagactctgat ggggtcttctacacaggccggaagcaggatgcctggtttgtggtggaccctgagtcaggg gagacccagatgacactgaccacagagggtccctccaccccccgcctctacattggccga acacagtatacggtcaccatgcatgacccaagagccccagccctgcgctggaacaccacc taccgccgctactcagcgccccccatggatggctcacctgggaaatacatgagccacctg gcgtcctgcgggatgggcctgctgctcactgtggacccaggaagcgggacggtgctgtgg acacaggacctgggcgtgcctgtgatgggcgtctacacctggcaccaggacggcctgcgc cagctgccgcatctcacgctggctcgagacactctgcatttcctcgccctccgctggggc cacatccgactgcctgcctcaggcccccgggacacagccaccctcttctctaccttggac acccagctgctaatgacgctgtatgtggggaaggatgaaactggcttctatgtctctaaa gcactggtccacacaggagtggccctggtgcctcgtggactgaccctggcccccgcagat ggccccaccacagatgaggtgacactccaagtctcaggagagcgagagggctcacccagc actgctgttagatacccctcaggcagtgtggccctcccaagccagtggctgctcattgga caccacgagctacccccagtcctgcacaccaccatgctgagggtccatcccaccctgggg agtggaactgcagagacaagacctccagagaatacccaggccccagccttcttcttggag caacagccgcaggtggtggagaagcagcaggagacccccctggcacctgcagactttgct cacatctcccaggatgcccagtccctgcactcgggggccagccggaggagccagaagagg cttcagagtccctcaaagcaagcccagccactcgacgaccctgaagctgagcaactcacc gtagtggggaagatttccttcaatcccaaggacgtgctgggccgcggggcaggcgggact ttcgttttccggggacagtttgagggacgggcagtggctgtcaagcggctcctccgcgag tgctttggcctggttcggcgggaagttcaactgctgcaggagtctgacaggcaccccaac gtgctccgctacttctgcaccgagcggggaccccagttccactacattgccctggagctc tgccgggcctccttgcaggagtacgtagaaaacccggacctggatcgcgggggtctggag cccgaggtcgtgctgcagcagctgatgtctggcctggcccacctgcactctttacacata gtgcaccgggacctgaagccaggaaatattctcatcaccgggcctgacagccagggcctg ggcagagtggtgctctcagacttcggcctctgcaagaagctgcctgctggccgctgtagc ttcagcctccactccggcatccccggcacggaaggctggatggcgcccgagcttctgcag ctcctgccaccagacagtcctaccagcgctgtggacatcttctctgcaggctgcgtgttc tactacgtgctttctggtggcagccacccctttggagacagtctttatcgccaggcaaac atcctcacaggggctccctgtctggctcacctggaggaagaggtccacgacaaggtggtt gcccgggacctggttggagccatgttgagcccactgccgcagccacgcccctctgccccc caggtgctggcccaccccttcttttggagcagagccaagcaactccagttcttccaggac gtcagtgactggctggagaaggagtccgagcaggagcccctggtgagggcactggaggcg ggaggctgcgcagtggtccgggacaactggcacgagcacatctccatgccgctgcagaca gatctgagaaagttccggtcctataaggggacatcagtgcgagacctgctccgtgctgtg aggaacaagaagcaccactacagggagctcccagttgaggtgcgacaggcactcggccaa gtccctgatggcttcgtccagtacttcacaaaccgcttcccacggctgctcctccacacg caccgagccatgaggagctgcgcctctgagagcctcttcctgccctactacccgccagac tcagaggccaggaggccatgccctggggccacagggaggtga >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_5|196_aa MGSRSSHAAVIPDGDSIRRETGFSQASLLRLHHRFRALDRNKKGYLSRMDLQQIGALAVN PLGDRIIESFFPDGSQRVDFPGFVRVLAHFRPVEDEDTETQDPKKPEPLNSRRNKLHYAF QLYDLDRDGKISRHEMLQVLRLMVGVQVTEEQLENIADRTVQEADEDGDGAVSFVEFTKS LEKMDVEQKMSIRILK >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_5|591_bp atggggtcgcgcagctcccacgccgcggtcattcccgacggggacagtattcggcgagag accggcttctcccaagccagcctgctccgcctgcaccaccggttccgggcactggacagg aataagaagggctacctgagccgcatggatctccagcagataggggcgctcgccgtgaac cccctgggagaccgaattatagaaagcttcttccccgatgggagccagcgagtggatttc ccaggctttgtcagggtcttggctcattttcgccctgtagaagatgaggacacagaaacc caagaccccaagaaacctgaacctctcaacagcagaaggaacaaacttcactatgcattt cagctctatgacctggatcgcgatgggaagatctccaggcatgagatgctgcaggttctc cgtctgatggttggggtacaggtgacagaagagcagctggagaacatcgctgaccgcacg gtgcaggaggctgatgaagatggggatggggctgtgtccttcgtggagttcaccaagtcc ttagagaagatggacgttgagcaaaaaatgagcatccggatcctgaagtga >gi568815582f:23655050_23857580|GENSCAN_predicted_peptide_6|203_aa MQRPTGTYKDNVEPESVATSNLDDVSDWQEEPVPMTGTWCGYRNCTFFYPISSPPSLKFK VSKRTVFDDTGKRDHGGINLTGFKKMVKEVSRGTVALGQILANQVEQAARGPAAPGPAPL GLRLPARKMADPAAGPPPSEGEESTVRFARKGALRQKNVHEVKNHKFTARFFKQPTFCSH CTDFIWGFGKQGFQCQEADPRDL >gi568815582f:23655050_23857580|GENSCAN_predicted_CDS_6|612_bp atgcaaagaccaacaggaacctacaaggacaacgtggaacctgagtctgttgccacctcc aatcttgatgatgtgagtgactggcaggaggagccagtgcccatgacaggcacctggtgt ggttatagaaactgcaccttcttctatccaatttcaagcccaccatctcttaaatttaag gtctccaaaagaacagtttttgatgacactgggaagagggatcatggtggtattaacctc acagggtttaagaagatggtaaaggaagtatctagaggaactgttgccttgggccaaatt ctggccaaccaggtggaacaagctgcccgcggtcccgcggccccggggccggcacctctc gggctccggctccccgcgcgcaagatggctgacccggctgcggggccgccgccgagcgag ggcgaggagagcaccgtgcgcttcgcccgcaaaggcgccctccggcagaagaacgtgcat gaggtcaagaaccacaaattcaccgcccgcttcttcaagcagcccaccttctgcagccac tgcaccgacttcatctggggcttcgggaagcagggattccagtgccaagaagcagacccc agagacctgtga