GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:48:22 Sequence gi568815596r:72975543_73212777 : 237235 bp : 48.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4566 4683 118 2 1 52 106 97 0.722 6.26 1.02 Intr + 5406 5467 62 1 2 91 80 32 0.478 1.15 1.03 Term + 8168 8296 129 1 0 79 54 106 0.680 4.48 1.04 PlyA + 8460 8465 6 -1.75 2.14 PlyA - 10566 10561 6 1.05 2.13 Term - 11331 11162 170 1 2 66 40 54 0.106 -3.56 2.12 Intr - 12806 12716 91 2 1 80 82 73 0.644 5.47 2.11 Intr - 23472 23407 66 0 0 95 68 84 0.773 6.20 2.10 Intr - 24945 24889 57 0 0 97 105 87 0.952 10.38 2.09 Intr - 26036 25983 54 2 0 121 92 24 0.786 5.28 2.08 Intr - 27922 27797 126 1 0 51 98 36 0.616 1.78 2.07 Intr - 30637 30438 200 2 2 86 51 95 0.827 4.67 2.06 Intr - 47034 46980 55 0 1 96 115 58 0.983 7.85 2.05 Intr - 47672 47646 27 2 0 108 89 8 0.689 1.21 2.04 Intr - 65389 65312 78 1 0 135 93 81 0.804 13.05 2.03 Intr - 83054 82986 69 2 0 112 107 133 0.740 16.98 2.02 Intr - 92977 92888 90 1 0 70 105 10 0.001 1.09 2.01 Init - 96163 96062 102 1 0 107 94 141 0.001 14.99 2.00 Prom - 97734 97695 40 -6.26 3.04 PlyA - 97861 97856 6 1.05 3.03 Term - 100182 99979 204 0 0 98 38 491 0.999 42.47 3.02 Intr - 100640 100451 190 1 1 115 116 138 0.999 18.89 3.01 Init - 106012 104526 1487 1 2 41 6 679 0.597 44.38 3.00 Prom - 106466 106427 40 -12.30 4.05 PlyA - 106504 106499 6 -0.45 4.04 Term - 107137 107038 100 0 1 94 45 122 0.988 6.10 4.03 Intr - 113207 112508 700 0 1 121 111 539 0.989 50.04 4.02 Intr - 113773 113337 437 0 2 136 72 659 0.978 62.32 4.01 Init - 121855 121803 53 1 2 56 72 59 0.014 1.73 4.00 Prom - 123789 123750 40 -2.26 5.02 PlyA - 128184 128179 6 1.05 5.01 Sngl - 137235 136801 435 0 0 106 48 904 0.785 82.47 5.00 Prom - 138317 138278 40 -0.36 6.00 Prom + 146632 146671 40 -4.66 6.01 Init + 151190 151361 172 1 1 41 95 80 0.557 3.60 6.02 Intr + 154204 154351 148 2 1 89 44 54 0.045 0.39 6.03 Intr + 163502 163546 45 2 0 115 81 15 0.099 1.02 6.04 Intr + 167758 167831 74 1 2 71 50 100 0.109 3.55 6.05 Intr + 188075 188155 81 0 0 58 86 38 0.246 0.21 6.06 Intr + 193502 193656 155 0 2 38 80 98 0.128 3.79 6.07 Term + 203218 203331 114 0 0 70 42 83 0.195 0.47 6.08 PlyA + 203397 203402 6 1.05 7.00 Prom + 206224 206263 40 -3.06 7.01 Init + 227125 227506 382 0 1 80 84 304 0.991 24.23 7.02 Intr + 232858 233072 215 2 2 104 75 112 0.594 9.93 7.03 Term + 235229 235387 159 1 0 90 49 249 0.999 19.14 7.04 PlyA + 235773 235778 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 48941 48855 87 2 0 86 64 74 0.945 3.44 S.002 Intr + 95417 95615 199 1 1 108 37 119 0.872 7.72 S.003 Intr + 95795 95966 172 0 1 54 43 84 0.888 -0.50 S.004 Term + 96225 96447 223 0 1 87 33 248 0.989 15.59 S.005 Init + 117958 118042 85 0 1 70 89 54 0.821 5.00 S.006 Term + 167758 167845 88 1 1 71 45 129 0.858 4.03 S.007 Init + 181230 181275 46 2 1 77 55 73 0.808 1.87 S.008 Term + 181484 181746 263 0 2 82 48 132 0.928 4.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_1|102_aa MLASPPLFSLLSSLPLFLPSTVVGNTVNDSNGVKGEIIKRIIKPQNWKAIYATSTYGAPT APAPPIAVNEMSIPLATATSGTQDLALLPAGNCSWYPSSKAP >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_1|309_bp atgctggcctcccctcccctcttctctcttctctcctccctccctctcttccttccttcc accgttgtgggcaacacagtcaatgattcaaatggtgtaaaaggggaaataataaaaaga attataaaaccccagaactggaaggccatttacgcaacaagcacttatggagcacctacg gcccctgccccacccattgctgtcaacgagatgtctattcccctggccactgccacctct gggacccaggacttggccctccttcctgctggtaactgctcctggtacccctcctccaag gccccttga >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_2|394_aa MADTATTASAAAASAASASSDAPPFQLGKPRFQQGSYKIASGVMFDSVRFISAETSQSWG VSKETSFYGRFRHFLDIIDPRTLFVTERRLREAVQLLEDYKHGTLRPGVTNEQLWSAQKI KQAILHPDTNEKIFMPFRMSGICQGCLLVATSDAQQGKQEGMRPLSQGSELTRCHVLPRA VSQSKLDDQAEPKSEEINSFCDEAVARRIIKGTLWCHGQCPGQQLSGKCSECYWAVSYDV PGGPSTVRGVVGLLLPNQTLASTVFWQWLNQSHNACVNYANRNATKPSPASKFIQGYLGA VISAVSIAVGLNVLVQKANKFTPATRLLIQRFVPFPAVGDCGRTGICGGDFSCVFAELGV YELTIDAPGWGPGDATKLCLALKKPTPCLGRRAK >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_2|1185_bp atggcggatacagcgactacagcatcggcggcggcggctagtgccgctagcgcctcgagc gatgcacctcctttccaactgggcaaaccccgcttccagcagggatcttacaaaattgca agtggtgtcatgtttgacagtgtaaggttcatcagtgctgagacatcccagagttggggt gtctctaaggagacgtccttctatggccgcttcaggcacttcttggatatcatcgaccct cgcacactctttgtcactgagagacgtctcagagaggctgtgcagctgctggaggactat aagcatgggaccctgcgcccgggggtcaccaatgaacagctctggagtgcacagaaaatc aagcaggctattctacatccggacaccaatgagaagatcttcatgccatttagaatgtca gggatttgtcagggatgtcttctcgtggcaacctcagatgctcagcagggcaagcaggaa ggcatgaggcctctgagccagggctcagaactgacacgctgccacgtcctcccacgtgct gtcagtcagagcaagttagatgaccaagcagagccaaaaagtgaggaaataaattccttc tgtgatgaggccgtggcaaggaggattataaagggaaccctgtggtgtcacgggcagtgt cctggacagcagttgtctggtaaatgcagtgagtgctattgggctgtatcttacgatgtt cctggtggtccaagtacagttcgaggggtagtcggtcttctcttgcccaaccagacactg gcatccactgtcttctggcagtggctgaaccagagccacaatgcctgtgtcaactatgca aaccgcaatgcgaccaagccttcacctgcatccaagttcatccagggatacctgggagct gtcatcagcgccgtctccattgctgtgggccttaatgtcctggttcagaaagccaacaag ttcaccccagccacccgccttctcatccagaggtttgtgccgttccctgctgtaggtgat tgtgggcgcacaggtatatgtggaggggatttcagctgcgtctttgctgaacttggagtc tatgagctcaccatagatgccccaggctggggccctggagatgccacaaaactctgcttg gcccttaagaagcctacaccttgtttggggagaagggcaaaatag >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_3|626_aa MLSTNLFAAASPAAATAAAAATTAAPEATPPGLLGLTNPFLTSLQSNPFFEELIADIALN SPSPAPSLPSASRASPTPLASPGKALPEWDNTFNVFAASRLRPEARSEILAPAGVGLEAA GLQDPGPGAMTAKAAEPQGEPGGGGGGGGGGGGRGGSSVWLEPRVPLDLGPNHQSASAAD PGLLGSVGAGLPSSSAQLQLRASGSEPDRELPAPEVEAGQSPADSGTSLFSSPEVISVWE RLPGPESAAEGQDDESSRGENQLCPDVETADDAWPWDVVTISPAAETASLVFRGESDEPA PQVQPESPETVSPKGSEGLPPPEPEPKPEWVSDKGLQPSTPPPKPPRLFTPSRSQEEEEE KAAVGLSNRGPETEGEDASPSALVVGPPETKEEGEKRESEESDSCSSATLLGQPGLEELV EDASPPVSGPCLSAPASCPEGPAPIPCHSKSLALQSQHIWGTPEVGEGPEAPEAQGQDPV GEGLGSLSATSQQADVPHPVKPLSAAPVEGSPDRKQSRSSLSIALSSGLEKLKTVTSGSI QPVTQAPQAGQMVDTKRLKDSAVLDQSAKYYHLTHDELISLLLQRERELSQRDEHVQELE SYIDRLLVRIMETSPTLLQIPPGPPK >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_3|1881_bp atgctaagcactaacctttttgcagccgcctcccccgctgctgccactgctgccgctgcc gccaccaccgccgcccctgaagccaccccacctggattattgggtcttaccaacccgttc ctcacctctttgcagagcaaccccttcttcgaggagctcatagccgacatagcactaaac tctccttcacctgctccctctctccccagtgcctcgagggccagccccacccccctggcc tcccctgggaaagccctgcctgagtgggacaacaccttcaacgtctttgctgccagcagg ctgcgtccagaggccaggagcgagatcctggcccctgcaggagtggggctggaggcggca gggctgcaagacccaggccctggggccatgactgcgaaggcagctgagccccagggagag cctgggggaggaggaggaggaggaggaggaggaggaggaagaggtgggagcagcgtgtgg ctggagcccagggttcctctggacttgggaccgaaccaccagagcgcgagcgcggctgac ccagggctcctcgggtcggtaggggctggcctgccctcctcgtcagcccagctacagctg agagcctcaggctcagaaccagacagggaactgccagccccggaagtggaagcagggcag agtccggcagacagtgggacatctctatttagctccccagaagtgatcagtgtgtgggag aggctgccgggcccagagagcgctgctgagggccaggacgatgagtcctcccgaggcgaa aatcagctttgccctgacgtcgagacagctgatgatgcctggccttgggatgtggtcacc atttctcctgcagctgagacagcctcactagtctttcggggagagtctgatgagcctgct ccccaggtgcagcctgaatcaccagaaactgtgagccccaaggggagcgaggggcttccc ccaccggagcccgagcctaaacccgagtgggtgtctgacaaggggctgcagcccagcacc ccacctcccaagccaccgcgcctcttcacaccctcaagatcccaggaggaggaggaggag aaggccgcagtggggctgagtaacagggggccggagacagagggagaagatgcctcccca agtgcactggttgtcggtcccccggagaccaaggaggagggagagaagcgtgagtcggag gagtctgacagctgctcctctgcaaccctgctgggccagcctggcctggaagagctagtg gaggatgccagcccccctgtgtctgggccctgcctgtctgcacccgccagctgccctgag ggtcctgcccccataccctgtcactcaaagagcttggctcttcagagtcagcacatctgg gggactccagaggttggagaaggccctgaggctcctgaggcccagggccaggatccagta ggagaggggcttgggtccttgtcagccacctcccagcaggctgatgttccccaccccgtg aagcccctcagtgccgcccctgtggagggcagccccgacaggaagcagtcccgctccagt ctgagcatagccctgagcagtgggctggagaagctcaaaacagtcacatctgggagcatt cagcctgtgacccaggccccccaggctggccagatggtggacaccaaaaggctgaaggac tcagctgtgctggaccagtcggccaagtactaccacctgacccacgatgagctcatcagc ctgctcctgcagcgggagcgggagctgagccagcgggacgagcatgtgcaggagctggag agctacatcgaccggctgctggtgcggatcatggagacctcacccacgctgctgcagatc cccccgggcccccccaaatag >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_4|429_aa MRVHTPVPACDQYDDERGWYKLHSKPGKKEKERGEIEVTIQFTRNNLSASMFDLSMKDKP RSPFSKIRDKMKGKKKYDLESASAILPSSAIEDPDLGSLGKMGKAKGFFLRNKLRKSSLT QSNTSLGSDSTLSSASGSLAYQGPGAELLTRSPSRSSWLSTEGGRDSAQSPKLFTHKRTY SDEANQMRVAPPRALLDLQGHLDAASRSSLCVNGSHIYNEEPQGPVRHRSSISGSLPSSG SLQAVSSRFSEEGPRSTDDTWPRGSRSNSSSEAVLGQEELSAQAKVLAPGASHPGEEEGA RLPEGKPVQVATPIVASSEAVAEKEGARKEERKPRMGLFHHHHQGLSRSELGRRSSLGEK GGPILGASPHHSSSGEEKAKSSWFGLREAKDPTQKPRVPEKPLVTSRVAAAVTLPALADK GWTIRAPFG >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_4|1290_bp atgcgtgtgcacacaccagtgccagcttgtgatcagtatgatgatgaaagagggtggtac aagctgcactccaagccaggcaagaaggagaaggaacgcggcgagattgaagtcaccatc cagttcacgcgcaacaacctgagcgccagtatgtttgacctgtccatgaaggacaagcca aggtctcccttcagcaagatcagggacaagatgaagggcaagaagaagtatgatctggaa tctgcctctgccatcctcccaagcagcgccatagaggatcctgacctgggcagcctgggc aagatgggcaaagccaaaggcttcttcctccgcaacaagctgcgcaagtcgtccctgacc cagtccaacacctcgctgggctcggacagcaccctgtcctcagccagcgggagcttggcc taccagggacctggcgccgaactcctcacccgctcaccaagccgtagcagctggctgtcc actgaagggggcagggactctgcacagtcccccaagctgttcacccataagaggacctac agcgatgaggccaaccagatgcgagtggctcctcctcgggcccttctggaccttcagggc cacctggatgctgcctcccgctcttcgctctgtgtcaatgggagccacatttacaatgag gagccccagggccctgtgcggcaccgcagctccatctcgggctcgcttccatcctctggc tccttgcaagctgtctcttcccggttctccgaggaggggcctcgttccacagatgacacc tggcccagaggcagtcgtagcaacagcagctcagaggcagtgcttggacaggaggagctg agtgctcaggctaaagtcctggcccctggggccagccaccctggagaggaggagggggcc cggctaccagagggcaagccagtccaggttgccacacccatagtggcctcctctgaggct gtggcagagaaggagggagcccggaaggaggaacgcaagccccggatgggtctcttccac caccaccaccaaggcctaagtcggagcgagttgggtcgccgaagctctctgggggaaaag gggggtcccatcctgggggcctccccacatcactcatccagtggggaggaaaaggccaag agtagctggtttggcttgagagaagccaaggacccgactcagaaacccagagtccctgag aagcccctggtgacatccagagttgctgctgctgtcacactgccagccttggcagacaaa ggctggaccatcagggcaccttttggttga >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_5|144_aa MALVRGAEPAAGPSRWLPTHVQVTVLRARGLRGKSSGAGSTSDAYTVIQVGREKYSTSVV EKTHGCPEWREECSFELPPGALDGLLRAQEADAGPAPWAASSAAACELVLTTMHRSLIGV DKFLGQATVALDEVFGAGRAQHTQ >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_5|435_bp atggccctggtgcggggcgcggagccggcggcggggccttcccgctggctgcccacgcac gtccaggtgacggtgctgcgggcccgcgggctgcggggcaagagctcgggagcgggcagc accagcgacgcgtacacggtgatccaggtgggccgcgagaagtacagtacgtcggtggtg gagaagacgcacggctgccccgagtggcgtgaggagtgctccttcgagctgccgccgggg gccctggatggcctgctgcgggcgcaggaggccgacgcgggcccggcgccctgggccgcg agctccgccgccgcctgcgagctggtgctcaccaccatgcaccgctcgctcatcggcgtc gacaagttcctgggccaggccacggtggcgctggacgaggtcttcggcgcaggccgcgcc cagcacacgcagtga >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_6|262_aa MGWTQRLTFKKEYENQQFYNFTMAKLGRPHLHQVVQINSTSDKSHGQNVPLICSHEKDCK LHESRNHTCLADLHIFSFHHCARHTAGAQEIFVDLRKEQKQQNMPERGQRDGAENSNPLI TRDLRCPEFIPMDILTSEWSEPLQFCHEETDTERLRNLPKATQPVVRTWVLDLGGGNSVR MQATLRDQDSKGYGSSRRAAVPTGCSILKLPRGSPECKLTGMHIQGSPINSQNCLSVAVS PEQQVLSFVALLTKNWTQPNIH >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_6|789_bp atgggctggactcagagactcaccttcaaaaaagagtatgaaaaccagcaattttacaat tttacaatggcaaagcttggcaggccccaccttcatcaggtggtccagattaacagcacc agtgataagtcccatggacagaatgtacccctgatatgcagccatgagaaagactgtaag ctccatgagagcaggaaccacacctgcctggctgacctccacatcttcagcttccaccac tgtgcccggcacacagcaggtgctcaagaaatatttgttgacttaagaaaggaacaaaaa caacaaaacatgcccgaaagaggtcaaagagatggggctgaaaattccaaccctctaatt acaagggatctacgctgccctgagttcatccccatggacatcctgaccagcgagtggtct gagcctctgcagttctgccatgaggaaacagacacagagaggttaaggaacttgcccaag gccacacagccagtagtgaggacttgggttttggacctgggtggaggaaacagtgtaaga atgcaggctacactgcgtgaccaggattccaaaggctacggaagttctaggagggcagct gttccaactggctgtagcattctgaagcttcccagaggctccccagagtgcaagctaaca ggtatgcacattcagggatctccaatcaactcccagaactgcctgagtgtggcggtcagc ccagagcagcaggtgctatcctttgtagcactgttaaccaaaaactggacacaaccaaat atccactaa >gi568815596r:72975543_73212777|GENSCAN_predicted_peptide_7|251_aa MPSPRPRGSPPPAPSGSRVRPPRSGRSPAPRSPTGPNTPRAPGRFESPFSVEAILARPDP CAPAASQPSGSACVHPAFWTAASLCATGGLPWACPTSWLPAYLSVGFYPVPGPRVAPVCG LLGFGVTGLELAHCSGLWAFPDWAPTEDLQDTERQQKRVRTMFNLEQLEELEKVFAKQHN LVGKKRAQLAARLKLTENQVRVWFQNRRVKYQKQQKLRAAVTSAEAASLDEPSSSSIASI QSDDAESGVDG >gi568815596r:72975543_73212777|GENSCAN_predicted_CDS_7|756_bp atgcctagccccaggccgcgaggcagcccgccacccgctccctcgggctctcgggtccga cctccgcgctctggccgctctccggcgcccaggtcccctactggcccgaacacgccccgc gctcccggacgcttcgagtcccctttctcggtcgaggccatcctggcgaggcccgacccc tgcgcgccggcggcctcccagccgtcgggctccgcctgcgtccacccggccttctggacc gctgcttccctgtgcgccaccgggggtctgccctgggcttgcccgacatcgtggctgccc gcctacctgagcgtaggtttttaccctgtgccagggccgcgcgtggctcccgtctgcggc ctgctgggcttcggcgtcacagggttggagctggctcactgctcaggactctgggccttc ccagactgggccccaacggaggacctacaggacactgagagacagcaaaagagagtccga actatgtttaacttggagcagctggaagagttggagaaagtgtttgcaaaacagcacaat ctggtggggaagaagagagcccagctggcagctcggctcaaacttacagagaaccaggtg agagtctggttccagaaccgcagggtcaagtatcagaagcagcaaaagctgagggcagca gttacatctgccgaggctgcctccctggatgagccttccagcagctccatcgccagtatc cagagtgatgatgccgagtcaggagtggacggctga