GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:14:25 Sequence gi568815579r:42483249_42694828 : 211580 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 55 50 6 1.05 1.10 Term - 2052 1825 228 0 0 27 32 172 0.486 2.13 1.09 Intr - 5894 5838 57 2 0 106 87 30 0.036 3.78 1.08 Intr - 27672 27641 32 1 2 100 121 30 0.004 5.35 1.07 Intr - 28380 28328 53 2 2 35 98 16 0.003 -4.15 1.06 Intr - 29231 29102 130 0 1 104 80 79 0.005 8.45 1.05 Intr - 35987 35700 288 0 0 86 93 281 0.301 25.62 1.04 Intr - 38240 38019 222 0 0 97 119 142 0.560 16.30 1.03 Intr - 38954 38676 279 0 0 61 76 280 0.998 21.55 1.02 Intr - 44152 43793 360 2 0 110 84 306 0.527 27.59 1.01 Init - 45126 45063 64 0 1 107 94 20 0.742 3.95 1.00 Prom - 61250 61211 40 -4.16 2.11 PlyA - 61332 61327 6 1.05 2.10 Term - 63571 63306 266 2 2 124 40 80 0.216 2.47 2.09 Intr - 65408 65311 98 1 2 103 24 53 0.204 0.05 2.08 Intr - 70627 70586 42 0 0 125 105 38 0.335 6.56 2.07 Intr - 72675 72558 118 1 1 104 74 45 0.180 4.22 2.06 Intr - 89195 89112 84 0 0 112 0 73 0.011 0.69 2.05 Intr - 105790 105536 255 2 0 110 119 84 0.772 11.12 2.04 Intr - 106487 106209 279 0 0 72 65 244 0.941 17.95 2.03 Intr - 107127 107083 45 1 0 116 108 1 0.902 3.28 2.02 Intr - 110652 110293 360 1 0 72 78 358 0.859 28.39 2.01 Init - 111580 111517 64 1 1 98 94 42 0.872 5.16 2.00 Prom - 115419 115380 40 -3.46 3.08 PlyA - 115661 115656 6 1.05 3.07 Term - 118731 118538 194 1 2 75 46 91 0.211 1.18 3.06 Intr - 142803 142549 255 1 0 107 95 205 0.657 20.52 3.05 Intr - 143199 143052 148 0 1 49 69 -17 0.169 -7.69 3.04 Intr - 143482 143247 236 2 2 58 75 218 0.666 14.91 3.03 Intr - 148626 148490 137 2 2 1 84 162 0.862 7.31 3.02 Intr - 148849 148714 136 2 1 124 47 51 0.853 4.23 3.01 Init - 152538 152532 7 0 1 61 113 0 0.538 0.89 3.00 Prom - 152965 152926 40 -2.76 4.00 Prom + 162518 162557 40 -1.66 4.01 Init + 166620 166625 6 2 0 72 76 0 0.281 -1.93 4.02 Intr + 167839 167933 95 0 2 81 65 90 0.554 4.76 4.03 Intr + 168039 168167 129 0 0 44 94 60 0.589 2.01 4.04 Intr + 172116 172252 137 0 2 21 78 96 0.490 2.11 4.05 Intr + 178397 178807 411 0 0 34 62 275 0.487 13.66 4.06 Intr + 178975 179127 153 2 0 53 89 53 0.404 1.94 4.07 Intr + 182668 182684 17 2 2 97 98 12 0.445 -1.64 4.08 Term + 187122 187235 114 2 0 26 53 149 0.834 3.77 4.09 PlyA + 187594 187599 6 1.05 5.02 PlyA - 188026 188021 6 1.05 5.01 Sngl - 197287 196775 513 1 0 70 37 183 0.966 7.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 29053 29198 146 0 2 82 94 159 0.969 15.49 S.002 Term + 30414 30438 25 0 1 144 51 32 0.990 2.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:42483249_42694828|GENSCAN_predicted_peptide_1|570_aa MGHLSAPLHRVRVPWQGLLLTASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQ QLFGYSWYKGERVDGNRQIVGYAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFY TLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWI NNQSLPVSPRLQLSNGNRTLTLLSVTRNDTGPYECEIQNPVSANRSDPVTLNVTYTYYRP GANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNSGSYTCHANNSVTGCNRT TVKTIIVTELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRWFFKNQSLPSSER MKLSQGNTTLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVNYNALPQENGLSPGAIA GIVIGVVALVALIAVALACFLHFGKTGRASDQRDLTEHKPSVSNHTQDHSNDPPNKDTIG DNGSFMKALTGMACQSKEKAPELPKPKKNRCFMCRKKVGLTGLTADVEICFVDFTVNSDK HNCPYDYKAEAAAKIRKENPVVVAEKIQRI >gi568815579r:42483249_42694828|GENSCAN_predicted_CDS_1|1713_bp atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc acagcctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatcc atgccattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccag caactttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgta ggatatgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagaca atataccccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctac accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta tacccggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaag gatgctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggata aacaatcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctc actctactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaaccca gtgagtgcgaaccgcagtgacccagtcaccttgaatgtcacctacacctattaccgtcca ggggcaaacctcagcctctcctgctatgcagcctctaacccacctgcacagtactcctgg cttatcaatggaacattccagcaaagcacacaagagctctttatccctaacatcactgtg aataatagtggatcctatacctgccacgccaataactcagtcactggctgcaacaggacc acagtcaagacgatcatagtcactgagctaagtccagtagtagcaaagccccaaatcaaa gccagcaagaccacagtcacaggagataaggactctgtgaacctgacctgctccacaaat gacactggaatctccatccgttggttcttcaaaaaccagagtctcccgtcctcggagagg atgaagctgtcccagggcaacaccaccctcagcataaaccctgtcaagagggaggatgct gggacgtattggtgtgaggtcttcaacccaatcagtaagaaccaaagcgaccccatcatg ctgaacgtaaactataatgctctaccacaagaaaatggcctctcacctggggccattgct ggcattgtgattggagtagtggccctggttgctctgatagcagtagccctggcatgtttt ctgcatttcgggaagaccggcagggcaagcgaccagcgtgatctcacagagcacaaaccc tcagtctccaaccacactcaggaccactccaatgacccacctaacaaggacaccattgga gacaatggttcttttatgaaggctttgactggaatggcatgccagagtaaagaaaaagct cctgaattgcccaaaccaaagaaaaacagatgtttcatgtgcagaaagaaagttggtctt acaggtttgactgccgatgtggaaatttgttttgtggacttcaccgttaactctgacaag cacaactgtccgtatgattacaaagcagaagctgcagcaaaaatcagaaaagagaatcca gttgttgtggctgaaaaaattcagagaatataa >gi568815579r:42483249_42694828|GENSCAN_predicted_peptide_2|536_aa MGPISAPSCRWRIPWQGLLLTASLFTFWNPPTTAQLTIEAVPSNAAEGKEVLLLVHNLPQ DPRGYNWYKGETVDANRRIIGYVISNQQITPGPAYSNRETIYPNASLLMRNVTRNDTGSY TLQVIKLNLMSEEVTGQFSVHHHSLDSALSLEVTDSTETPKPSISSNNSNPVEDKDAVAF TCEPETQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLLSVTRNDVGPYECEIQNPASANF SDPVTLNVLYGPDAPTISPSDTYYHAGVNLNLSCHAASNPPSQYSWSVNGTFQQYTQKLF IPNITTKNSGSYACHTTNSATGRNRTTVRMITVSGTLVELEDIMLSKLTQKQKNKYCMFS LIKKNAPGLPVGAFTGIVTRVLVGVAPVATLACFLLLVRTGRPRYPTPGQPLPSMRHHSR QHMDGKRLREKLPVLVRSEAAIKNCWRLGAHISLSTCLTAKAISHLLPSTLPNPVHGGDG PSWNTHHSLSAFSGNHRFYRGHLWNQNSWKDEQRELLRTSWGSWHLVHHMDIINST >gi568815579r:42483249_42694828|GENSCAN_predicted_CDS_2|1611_bp atggggcccatctcagccccttcctgcagatggcgcatcccctggcaggggctcctgctc acagcctcacttttcaccttctggaacccgcccaccactgctcagctcactattgaagct gtgccatccaatgctgcagaggggaaggaggttcttctacttgtccacaatctgccccag gaccctcgtggctacaactggtacaaaggggaaacagtggatgccaaccgtcgaattata ggatatgtaatatcaaatcaacagattaccccagggcctgcatacagcaatcgagagaca atataccccaatgcatccctgctgatgcggaacgtcaccagaaatgacacaggatcctac accctacaagtcataaagctaaatcttatgagtgaagaagtaactggccagttcagcgta catcatcattccttggactctgctctatctttagaggtcactgactcaacggagactccc aagccctccatctccagcaacaactccaaccccgtggaggacaaggatgctgtggccttc acctgtgaacctgagactcagaacacaacctacctgtggtgggtaaatggtcagagtctc ccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactctactcagtgtc acaaggaatgacgtaggaccctatgaatgtgaaatacagaacccagcgagtgcaaacttc agtgacccagtcaccctgaatgtcctctatggcccagatgcccccaccatttccccttca gacacctattaccatgcaggggtaaatctcaacctctcctgccatgcggcctctaatcca ccctcacagtattcttggtctgtcaatggcacattccagcaatacacacaaaagctcttt atccccaacatcactacaaagaacagcggatcctatgcctgccacaccactaactcagcc actggccgcaacaggaccacagtcaggatgatcacagtctctggaacattggtggagcta gaggacattatgctcagcaaactaacgcagaaacagaaaaacaaatactgcatgttctca cttataaaaaaaaatgccccaggccttcctgtgggggccttcactggcatcgtgaccagg gttctggtcggggtggcaccggtggccaccctggcatgtttcctgctcctcgtcaggact ggaaggccccgctacccgaccccaggacagccgctcccatctatgaggcatcacagcagg cagcacatggatggaaagagactgcgggaaaagctgcctgtattagtccgttccgaagct gctataaagaactgctggagactgggagcccacatcagcctgtccacctgccttacagcc aaagccatcagccacttgctcccttccaccctccccaaccctgtacatggaggtgacggc ccctcctggaacacccaccactcactgtcagccttcagtggtaatcacaggttctacagg ggtcacctgtggaaccagaactcttggaaggacgagcagagggagctgctcaggacatct tggggctcctggcatctggtccaccacatggacataatcaatagcacttag >gi568815579r:42483249_42694828|GENSCAN_predicted_peptide_3|370_aa MKASLFTFWNPPTTAQVTVEVLPSNAVEEKEVVLLVYNLPQDILGHNCGQETVYPKARLE IWNIALNDSGSYTVQILKADLMTEKVTAQISVHPKLSKPFVTISNSNPVENEDTAALTCE TEIQNTTYLWWVNGQSLPVSPRLKLSNDNRTLTLPRVTRKDAGTYECEIRNLYLLFLCSP GCQPKSTWPEARPLSPSQVQVQRPLLLGTQAGHDFLPQANLDGPDDPNISPSYSYYHPGE NLNLSCNTASNPPPQYSWQINGKFQKSAQELFISKITKKNSGIYICFVRNSATDSRCFTI KKITVTGISVQGSEEMAEASEPKDGERGSNPGAAPWNESRWGHTENVVQEGIPRNGQERS RGHRLCGSAA >gi568815579r:42483249_42694828|GENSCAN_predicted_CDS_3|1113_bp atgaaagcctcacttttcaccttctggaacccacccaccactgcccaagttacagttgaa gttctgccatctaatgcggtagaggagaaggaggttgttctacttgtctacaatctgccc caggatatacttggccacaactgtggccaagagacagtataccccaaagcacgcctggag atctggaacatcgccctgaatgactcaggatcctacaccgtgcaaattttaaaggcagat cttatgactgaaaaagtaactgcacagatcagcgtacacccgaaactatccaagcccttc gtcaccatcagcaactccaaccccgtggagaatgaggatactgcggccttaacctgtgaa actgagattcagaacacaacttacctgtggtgggtaaatggtcagagcctcccggtcagt cccaggctgaagctgtccaatgacaacaggaccctcactctacccagagtcacaaggaag gacgcaggaacatatgaatgtgaaatccggaatctatatcttctgttcctctgtagccca ggctgccagcccaaatccacatggccagaggccaggcctctcagtccctctcaggtccaa gtacagagacctttactcctgggcacccaggctggccatgacttcctgccccaggcaaac ctggatggcccagacgaccccaacatttccccctcttactcctattaccatccaggggaa aacctcaacctctcctgcaacacagcctctaacccacccccacagtattcttggcagatt aatgggaagttccaaaaatcagcacaagagctctttatctccaaaatcactaaaaagaat agtggaatctacatctgtttcgttcgcaactcagccactgacagcaggtgtttcacgatc aagaagatcacagtcactgggatcagtgttcagggctcagaagaaatggcagaggccagt gagcccaaagatggtgagaggggcagcaacccaggggcagctccctggaacgagagcagg tggggccacacggagaacgtggtgcaggaaggaatacccaggaatgggcaggagaggagt cgaggacacaggctctgtgggtctgcagcctag >gi568815579r:42483249_42694828|GENSCAN_predicted_peptide_4|353_aa MEIQQESSICKKALTRTKSADTSTLDYPASRTVSFSINLAGSSCPQQHQRPHEKSSSRGS ACMGPMRQPQCQSLDGSDQETINFYACLLIHNVTQNDTGFYTLEVIKLNLMNEEITIQFS LRPELSKPSVTSNNSNPMEDKDAVALTCEPETQGTTYLWWVNGQSLPASSRLQLSNNNRT LTVLNVTRNYTGPYKCEIWNRVSVSHSYPVTLDVLCSPGCQPKLTWPESRPLSPSQVQEQ RLSALDIKAVHDCLPQANFEGHCGLSTDQEYSWLINEKLQQYTQELFIPKITAKNSGVYA CFVRNSATDLSFSTYKEGVTVAKKDVHMPKHPELADKNVPNLHVMKAMQSLKS >gi568815579r:42483249_42694828|GENSCAN_predicted_CDS_4|1062_bp atggagatacagcaagaaagcagcatctgcaagaaagccctcaccaggaccaaatcagct gacacctcaaccttggactacccagcttcccgaactgtgagtttctccatcaacttagct ggcagctcctgtccccagcagcatcagaggccccatgaaaagagctccagcaggggctca gcctgcatgggtcccatgcggcagcctcagtgtcagagcctggatggcagtgatcaggag acgataaacttctatgcatgcctgctgatccataacgtgacccaaaatgacacaggattc tacacccttgaagttataaagctaaatcttatgaatgaagaaataaccatacaattcagc ttacgcccagagctgtccaagccctccgtcaccagcaacaactccaaccccatggaggac aaggatgctgtggccttaacttgtgaacctgagacccagggcacaacctacctgtggtgg gtaaatggtcagagcctcccggccagttccaggctgcagctgtccaataacaacaggacc ctcactgtactcaatgtcacaaggaattacacaggaccctataagtgtgaaatctggaat cgagtgagtgtcagtcacagttaccctgtcaccctggatgtcctctgtagcccaggctgc caacccaaattgacatggccagaatccaggcctctcagtccttctcaggtccaagaacag agactctcagccctggatatcaaagctgtccatgactgcctgccccaggcaaattttgag ggacactgtggcctttctacagaccaggagtattcttggcttatcaatgaaaagctccag caatacacacaagagctctttatccccaaaatcactgcaaagaatagcggggtctatgct tgttttgtccgtaactcagccactgacctcagtttttctacatacaaggaaggagtcacg gtggctaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgtgccc aaccttcatgtcatgaaggccatgcagtctctcaagtcctga >gi568815579r:42483249_42694828|GENSCAN_predicted_peptide_5|170_aa MDKFLDTYTLPSLNQEEVESLKRPITSSEIEAVVNSLLSKKSPGPDGFTAKFCQRYKEEL IPFLLKLFQAIEKQGLLPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILP NRIQQHIKKLIHHDQVGFIPGIQGSFNICKSIHIIHYINKTHDKNHRLSQ >gi568815579r:42483249_42694828|GENSCAN_predicted_CDS_5|513_bp atggataaattcctggacacatacaccctcccaagtctaaaccaggaagaagtcgaatcc ctgaagagaccaataacaagttctgaaattgaggcagtagttaatagcctactaagcaaa aaaagcccaggaccagatggattcacagccaaattctgccagaggtacaaagaggagctg ataccattccttctgaaactattccaagcaatagaaaaacagggactcctccctaactca ttttatgaggccagcatcatcctgataccaaaacctggcagagacacaaccaaaaaagaa aatttcaggccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatattgcca aaccgaatccagcagcacatcaaaaagcttatccaccacgatcaagtcggcttcatccct gggatacaaggctcattcaacatatgcaaatcaatacacataatccattacataaacaaa acccatgacaagaaccacagattatctcaatag