GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:58:02 Sequence gi568815581r:37599076_37844884 : 245809 bp : 44.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5770 5811 42 2 0 78 82 54 0.122 2.01 1.02 Term + 10094 10419 326 0 2 66 41 144 0.686 2.43 1.03 PlyA + 11916 11921 6 1.05 2.16 PlyA - 12952 12947 6 1.05 2.15 Term - 15278 15221 58 1 1 112 42 52 0.597 0.16 2.14 Intr - 19309 19217 93 0 0 79 98 37 0.506 2.88 2.13 Intr - 21873 21798 76 1 1 72 94 -17 0.480 -4.13 2.12 Intr - 22202 22052 151 2 1 49 95 42 0.672 0.74 2.11 Intr - 22445 22323 123 2 0 90 103 32 0.756 5.68 2.10 Intr - 25359 25269 91 2 1 55 71 77 0.924 2.70 2.09 Intr - 27023 26820 204 1 0 59 63 146 0.868 7.52 2.08 Intr - 27785 27713 73 0 1 82 75 11 0.848 -2.34 2.07 Intr - 29597 29486 112 2 1 83 87 54 0.997 4.75 2.06 Intr - 31098 30955 144 0 0 61 80 86 0.971 5.58 2.05 Intr - 33223 33038 186 1 0 67 84 127 0.998 10.09 2.04 Intr - 36565 36476 90 1 0 5 127 75 0.463 3.29 2.03 Intr - 42987 42944 44 1 2 90 36 33 0.548 -3.74 2.02 Intr - 43233 43035 199 0 1 65 92 85 0.635 5.52 2.01 Init - 44345 44259 87 2 0 87 84 85 0.963 8.70 2.00 Prom - 70055 70016 40 -2.96 3.00 Prom + 72736 72775 40 -6.26 3.01 Init + 79569 79704 136 2 1 13 116 121 0.737 7.70 3.02 Intr + 80474 81200 727 0 1 -32 53 273 0.436 2.29 3.03 Intr + 81959 81993 35 2 2 50 92 38 0.496 -1.73 3.04 Intr + 87942 88171 230 1 2 80 115 112 0.469 10.69 3.05 Term + 90877 90906 30 0 0 98 38 40 0.517 -2.15 3.06 PlyA + 92400 92405 6 1.05 4.13 PlyA - 96690 96685 6 1.05 4.12 Term - 97922 97816 107 0 2 78 55 44 0.287 -1.33 4.11 Intr - 98250 98190 61 0 1 108 61 -7 0.185 -3.09 4.10 Intr - 100119 100001 119 1 2 88 95 78 0.935 8.68 4.09 Intr - 102102 101908 195 1 0 126 70 214 0.999 22.79 4.08 Intr - 105974 105842 133 2 1 93 106 -7 0.572 1.82 4.07 Intr - 111588 111428 161 1 2 100 101 121 0.977 14.31 4.06 Intr - 128240 128112 129 0 0 55 77 90 0.221 5.27 4.05 Intr - 130636 130547 90 2 0 86 62 44 0.232 1.57 4.04 Intr - 132755 132520 236 1 2 126 115 242 0.999 28.03 4.03 Intr - 134746 134482 265 2 1 115 92 294 0.994 29.07 4.02 Intr - 140564 140365 200 1 2 69 105 263 0.530 25.09 4.01 Init - 145809 145466 344 0 2 67 96 804 0.998 73.81 4.00 Prom - 149340 149301 40 -5.46 5.05 PlyA - 152078 152073 6 1.05 5.04 Term - 154097 153919 179 0 2 104 53 82 0.483 4.15 5.03 Intr - 155729 155672 58 2 1 52 66 56 0.419 -1.74 5.02 Intr - 156501 156388 114 0 0 39 90 67 0.234 2.64 5.01 Init - 163773 163684 90 0 0 54 56 151 0.886 6.99 5.00 Prom - 171467 171428 40 -1.76 6.02 PlyA - 174111 174106 6 1.05 6.01 Sngl - 184365 183640 726 0 0 60 35 161 0.924 4.25 6.00 Prom - 185046 185007 40 -3.26 7.04 PlyA - 187608 187603 6 1.05 7.03 Term - 190968 190947 22 2 1 142 38 27 0.175 0.88 7.02 Intr - 196257 196185 73 1 1 57 100 47 0.475 1.26 7.01 Init - 197314 197197 118 1 1 91 46 57 0.697 2.22 7.00 Prom - 198507 198468 40 -4.76 8.04 PlyA - 198930 198925 6 1.05 8.03 Term - 226446 226378 69 0 0 104 47 50 0.335 0.44 8.02 Intr - 229889 229687 203 0 2 95 33 133 0.492 7.50 8.01 Init - 231412 231361 52 1 1 74 80 27 0.846 1.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 179584 179449 136 1 1 70 101 72 0.827 7.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_1|122_aa XNMEALGEEDRGTAAQFQGKGSGLEKTAITAKRPSPPPRRPAGSRPALHTYPPPPADPAP APAAPPPPEPAPGRSAILLPTCRCLRRRHLISSCQLNTATSGSNTRPHLAADRAMGRGLA PT >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_1|369_bp nggaatatggaggcccttggagaggaagaccggggaacagctgcccagttccaaggaaaa ggatcagggctggagaaaactgccataactgccaagcgcccctcgccgcccccgcggagg ccagcgggctcccgcccggctcttcacacctacccgcctcccccggcggaccccgcgcca gctcccgcggccccgccgccaccagaaccagctcctggccgcagcgccatcttgctcccg acctgccgctgccttcgccgccgccaccttatcagcagctgtcagctgaacacagccact tccgggtcaaacaccaggccccacctcgccgcggaccgggcgatggggcggggcctcgcg cccacgtga >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_2|576_aa MDVHDLFRRLGAGAKFDTRRFSADAARFQIGKRKYDFDSSEVLQGLDFFGNKKSVPGVCG ASQTHQKPQNGEKKEESLTERKREQSKKKRKTMTSVIKIIQAYGYKSDNKPKAGNDPNVP HQVNDELYIHYNGILPSNKTINFLRNKHKIHVQGTDLPDPIATFQQLDQEYKINSRLLQN ILDAGFQMPTPIQMQAIPVMLHGRELLASAPTGSGKTLAFSIPILMQLKQPANKGFRALI ISPTRELASQIHRELIKISEGTGFRIHMIHKAAVAAKKFGPKSSKKFDILVTTPNRLIYL LKQDPPGIDLASVEWLVVDESDKLFEDGKTGFRDQLASIFLACTSHKVRRAMFSATFAYD VEQWCKLNLDNVISVSIGARNSAVETVEQELLFVGSETGKLLAVRELVKKGFNPPVLVFV QSIERAKELFHELIYEGINVDVIHAERTQQQRDNTVHSFRAGKIWVLICTALLARGIDFK GVNLVINYDFPTSSVEYIHRIGRTGRAGNKGKAITFFTEDDKPLLRSKQKKKMIKKPLER ESISTTPKCFLEKAKDKQKKVTGQNSKKKVALEDKS >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_2|1731_bp atggacgtccacgatctctttcgccggctcggcgcgggggccaaattcgacacgagacgc ttctcggcagacgcagctcgattccagataggaaaaaggaaatatgactttgattcttcg gaggtgcttcagggactggacttttttggaaacaagaagtctgtcccaggtgtgtgtgga gcatcacaaacacatcagaagccccaaaatggagagaaaaaagaagagagcctaactgaa aggaagagggagcagagcaagaaaaaaaggaagacgatgacttcagttataaaaataata caggcttatggctacaagtcagacaataagccaaaagctggaaatgacccaaatgtccct catcaggtgaatgatgaactgtacatccattacaatggaatactacctagtaataaaacg ataaacttcttgcggaataaacacaaaattcacgtccaaggaaccgatcttcctgaccca attgctacatttcagcaacttgaccaggaatataaaatcaattctcgactacttcagaac attctagatgcaggtttccaaatgcctacgccaatccaaatgcaagccatcccagttatg ctgcatggtcgggaacttctggcttctgctccaactggatctggaaaaacattagctttt agcattcctattttaatgcagctgaaacaacccgcaaataaaggcttcagagccctgatt atatcaccaacacgagaacttgccagccagattcacagagagttaataaaaatttctgag ggaacaggattcagaatacacatgatccacaaagcagcagtggcagccaagaaatttgga cctaaatcatctaaaaagtttgatattcttgtgactactccaaatcgactaatctattta ttaaagcaagatccccccggaatcgacctagcaagtgttgagtggcttgtagtagacgaa tcagataaactgtttgaagatggcaaaactgggttcagagaccagctggcttccattttc ctggcctgcacatcccacaaggtccgaagagctatgttcagtgcaacttttgcatatgat gttgaacagtggtgcaaactcaacctggacaatgtcatcagtgtgtccattggagcaagg aattctgcagtagaaactgtagaacaagagcttctctttgttggatctgagaccggaaaa cttctggccgtgagagaacttgttaaaaagggtttcaatccacctgttcttgtttttgtt cagtccattgaaagggctaaagaactttttcatgagctcatatatgaaggtattaatgtg gatgttattcatgcagagagaacacaacaacagagagataacacagtccacagtttcaga gcaggaaaaatctgggttctgatttgtacagccttgctagcaagagggattgattttaaa ggtgtgaacttggtgatcaactatgactttccaactagctcagtggaatatatccacagg ataggtcgaactggaagagcagggaataagggaaaagcaattacatttttcactgaggat gataagccattattaagaagcaaacaaaagaaaaagatgattaagaaaccattggaaagg gagagcattagtacaactccaaaatgtttcttagaaaaagctaaggataaacagaaaaag gtcactggtcagaacagcaagaagaaagtagctcttgaagacaaaagttaa >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_3|385_aa MYPYKREEEGDLTQKRRRCDHGGGDRSDVATSQELLTANSGWKNQAWYWYQNGDIDQWNR TEASEIIPQIYNYLIFDKPDKNKKWGKDSLFNKRCWENWLAIRRKLKLDPFLTPYAKINS RWIKDLHVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDNWDLIKLKSFCT AKETTIRVNGQPTERGKIFAIYSSDKGLISRIYNELKQIYKKKTNNPINKWMKDMNRHFS KENIYAANRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGSNRRACFHFDFHHHI TDSSFRISLFDEGSQHRQYGFLASSSEVPCLPPPGQTGVLDIVGEALWQYCIEGKLGFGL RLLRVDCLRCQQDVRQALPPSEGFE >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_3|1158_bp atgtatccttataagagggaggaagagggagacttgacacagaagagaaggcgatgtgac cacggaggcggagataggagtgatgtggccacaagccaagaactgctgacagccaatagt ggctggaagaatcaagcgtggtactggtaccaaaatggagatatagaccaatggaacaga acagaggcctcagaaataataccacagatctacaactatctgatctttgacaaacctgac aaaaacaagaaatggggaaaggattccctattcaacaaacggtgctgggaaaactggcta gccatacgtagaaagctgaaactggatcccttccttacaccttatgcaaaaattaattca agatggattaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaaccta ggcaataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagca atggcaacaaaagccaaaattgacaattgggatctaattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacgggcaacctacagaaagggggaaaatttttgca atctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacgaacaaccctatcaacaagtggatgaaggatatgaacagacacttctca aaagaaaacatttatgcagccaacagacacatgaaaaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggcaatc attaaaaagtcaggaagcaacagacgtgcttgcttccacttcgacttccaccatcatata acagattcaagttttcgcatcagtttgttcgatgaaggatcacaacatagacagtacggc tttcttgcttcctcttcggaggttccttgtctcccacctccaggacagacaggagtcctt gacatcgtgggagaggcattgtggcaatactgcatagaagggaaactgggcttcgggctg cgcctcctgagagtggattgtctgaggtgccagcaggacgtccgtcaggctttacctcct agtgaaggctttgaataa >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_4|679_aa MVSKLTSLQQELLSALLSSGVTKEVLVQALEELLPSPNFGVKLETLPLSPGSGAEPDTKP VFHTLTNGHAKGRLSGDEGSEDGDDYDTPPILKELQALNTEEAAEQRAEVDRMLSEDPWR AAKMIKGYMQQHNIPQREVVDVTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREIL RQFNQTVQSSGNMTDKSSQDQLLFLFPEFSQQSHGPGQSDDACSEPTNKKMRRNRFKWGP ASQQILYQAYDRQKNPSKEEREALVEECNRAECLQRGVSPSKAHGLGSNLVTEVRVYNWF ANRRKEEAFRQKLAMDAYSSNQTHSLNPLLSHGSPHHQPSSSPPNKLSALRCVECLPSPG NMLVMYELRVLPSGRLLSALGQQELSLVIGSMTAHSEAIEQVWSDGLYLPSGDESRERPL KGVRYSQQGNNEITSSSTISHHGNSAMVTSQSVLQQVSPASLDPGHNLLSPDGKMISVSG GGLPPVSTLTNIHSLSHHNPQQSQNLIMTPLSGVMAIAQSLNTSQAQSVPVINSVAGSLA ALQPVQFSQQLHSPHQQPLMQQSPGSHMAQQPFMAAVTQLQNSHMYAHKQEPPQYSHTSR FPSAMVVTDTSSISTLTNMSSSKQTFQESGNRLISAMMRMIITLRTRVKNKFTCHIPLRK RSSNKLKEKEEWERHSEVK >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_4|2040_bp atggtgtccaagctcacgtcgctccagcaagaactcctgagcgccctgctgagctccggg gtcaccaaggaggtgctggttcaggccttggaggagttgctgccatccccgaacttcggg gtgaagctggagacgctgcccctgtcccctggcagcggggccgagcccgacaccaagccg gtcttccatactctcaccaacggccacgccaagggccgcttgtccggcgacgagggctcc gaggacggcgacgactatgacacacctcccatcctcaaggagctgcaggcgctcaacacc gaggaggcggcggagcagcgggcggaggtggaccggatgctcagtgaggacccttggagg gctgctaaaatgatcaagggttacatgcagcaacacaacatcccccagagggaggtggtc gatgtcaccggcctgaaccagtcgcacctctcccagcatctcaacaagggcacccctatg aagacccagaagcgtgccgctctgtacacctggtacgtcagaaagcaacgagagatcctc cgacaattcaaccagacagtccagagttctggaaatatgacagacaaaagcagtcaggat cagctgctgtttctctttccagagttcagtcaacagagccatgggcctgggcagtccgat gatgcctgctctgagcccaccaacaagaagatgcgccgcaaccggttcaaatgggggccc gcgtcccagcaaatcttgtaccaggcctacgatcggcaaaagaaccccagcaaggaagag agagaggccttagtggaggaatgcaacagggcagaatgtttgcagcgaggggtgtccccc tccaaagcccacggcctgggctccaacttggtcactgaggtccgtgtctacaactggttt gcaaaccgcaggaaggaggaggcattccggcaaaagctggccatggacgcctatagctcc aaccagactcacagcctgaaccctctgctctcccacggctccccccaccaccagcccagc tcctctcctccaaacaagctgtcagcactacgatgtgttgaatgcttaccaagtcctggg aatatgctagtgatgtacgagctgcgcgtcctgccctctgggaggttactgtctgctctt ggccagcaggagctgtccctggttataggctccatgacagcccattctgaagccattgag caggtctggagtgatggcctctacctcccttctggagatgaatcaagagaaaggccccta aaaggagtgcgctacagccagcagggaaacaatgagatcacttcctcctcaacaatcagt caccatggcaacagcgccatggtgaccagccagtcggttttacagcaagtctccccagcc agcctggacccaggccacaatctcctctcacctgatggtaaaatgatctcagtctcagga ggaggtttgcccccagtcagcaccttgacgaatatccacagcctctcccaccataatccc cagcaatctcaaaacctcatcatgacacccctctctggagtcatggcaattgcacaaagc ctcaacacctcccaagcacagagtgtccctgtcatcaacagtgtggccggcagcctggca gccctgcagcccgtccagttctcccagcagctgcacagccctcaccagcagcccctcatg cagcagagcccaggcagccacatggcccagcagcccttcatggcagctgtgactcagctg cagaactcacacatgtacgcacacaagcaggaacccccccagtattcccacacctcccgg tttccatctgcaatggtggtcacagataccagcagcatcagtacactcaccaacatgtct tcaagtaaacagacatttcaggagtctggcaatagactcatttctgcgatgatgaggatg attattacactcagaacaagagtaaaaaataagttcacttgccatattccactcaggaaa cgttcttccaacaagttgaaagagaaggaggaatgggaaaggcacagtgaagtaaaatga >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_5|146_aa MLLLQLLLPLPPLLLLLFSVSLCCPGWSEVGMEVKPGLPSHNSLPQPMADGHPPRALQPW HKDTLGPEGSCKVWFAWKELFQVEEAADKETEVQSVSLPKVTSEKQQRQVSTQIGLTPSP MLIPCGTCLSAGTENQGKLYLNLNPI >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_5|441_bp atgctgttgctgcagctgctgctgccgctgccgccgctgctgctcctgcttttttcagtc tcactctgttgcccaggctggagtgaagtgggcatggaagtcaaaccaggtctcccttcc cacaactcgctgccccagcccatggcagatggacatcccccaagggcattacaaccatgg cacaaggacacccttggtccagagggaagttgcaaagtctggtttgcctggaaggagctc ttccaggtggaggaagcggcagataaagaaactgaagttcagagtgtcagcttacccaag gtcacatctgaaaagcagcagagacaggtttcaacccagattgggctgactccaagcccc atgctgattccctgtggcacctgcctctcggctggtacagaaaaccagggaaagctgtat ttgaatctcaacccaatctga >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_6|241_aa MDKFLDTYTLPRLNQEEVESLNRPITSSEIEAIINRLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQTIEKEGLLSNLFYEASAILIPKPGRDTTKKENFRPISLINIDAKILNKILA SRIQQHIKKFIHHDQVSFIPRMQGWFNICKSINIIHHINRTKDKNHMIISIDTEKAFHKI QHPLVLKTLNKLRIDGTYIKTIRTIYDKPTANIILNGQKLEAFSLKTSTRQGCPLSPSYS T >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_6|726_bp atggataagttcctggacacatacactctcccaagactgaaccaggaagaagttgaatcc ctgaatagaccaataacaagttctgaaattgaggcaataattaatagactaccaaccaaa aaaagcccaggaccagacggatttacagctgaattctaccagaggtacaaagaggagctg gtaccatttcttctgaaactattccaaacaattgaaaaggagggactcctctcaaactta ttttatgaggccagtgccattctgataccaaaacctggcagagatacaacaaaaaaagaa aacttcaggccaatatccctcataaacattgatgcaaaaatcctcaataaaatactggca agccgaatccagcagcacatcaaaaagtttatccaccacgatcaagtcagcttcatccct aggatgcaaggctggttcaacatatgcaaatcaataaacataattcatcacataaacaga actaaagacaaaaaccacatgattatctcaatagatacagaaaaggccttccataaaatt caacatcccttagtgttaaaaactctcaataagctacgtattgatgggacatacatcaag acaataagaaccatttatgacaaacccacagccaatatcatactgaatgggcaaaagttg gaagcattctccttgaaaaccagcacaagacaaggatgccctctctcaccctcctattca acatag >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_7|70_aa MQRGQLPRDPPMQGALQPPNPTLKHQVSAARLSSRSSPRVEFYLAAPAKDNSSDNCAFLE RRVRLGGDRQ >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_7|213_bp atgcagagaggccagctgcccagggacccacccatgcaaggagctctgcagcctccgaat ccaaccctgaagcaccaagtgtctgcagcccggctgagctctcgcagcagcccacgagtt gagttttatctagcagcacctgcaaaagataactcctcggataactgtgcttttctagaa aggcgagtaaggcttggtggagaccgacagtaa >gi568815581r:37599076_37844884|GENSCAN_predicted_peptide_8|107_aa MNASPDKTEMEENQTLKKAEQPGDNGRKEALVSSGTVLLTEQGTALAGDVYADFGVWIRR GSWLADGAQAYGKQDDLRVKNKQPWLDCGLLRSHGHQGSIAMDFFDK >gi568815581r:37599076_37844884|GENSCAN_predicted_CDS_8|324_bp atgaatgcctccccagataagacagaaatggaagaaaaccaaactttgaaaaaagcagag caaccaggagacaacggaaggaaagaggccctggtctcctcgggaactgtgttactgaca gaacaaggaacagctttggctggagatgtttatgctgactttggagtatggatcagaaga gggtcctggcttgctgatggagctcaggcctatggaaaacaggatgatcttagagttaaa aacaaacaaccctggttagactgtgggctcctgaggtcccatggccatcagggaagtatt gccatggacttctttgataaatag